|
|
|
zombie process in CLOSE_WAIT mode
|
Previous Topic
Next Topic
|
| Message |
Author |
Posted: Fri Jul 30, 2010 11:27 am Subject: zombie process in CLOSE_WAIT mode |
|
|
Gilles Devaux
|
|
Hi,
I'm running into an issue sometimes where a plugin (at least a command run by a plugin) would run 100% cpu in zombie mode
CentOs 5.4 rackspace
[gilles@client ~]$ uname -a
Linux chefserver 2.6.32.1-rscloud #15 SMP Mon Feb 22 13:22:15 UTC 2010 x86_64 x86_64 x86_64 GNU/Linux
[gilles@client ~]$ /usr/sbin/munin-node --version
munin-node (munin-node) version 1.4.3.
Written by Audun Ytterdal, Jimmy Olsen, Tore Anderson / Linpro AS
[gilles@client ~]$ ps -elf | grep netstat
0 S gilles 11993 11972 0 100 0 - 15295 - 15:46 pts/2 00:00:00 grep netstat
1 R munin 27483 1 99 100 0 - 0 ? 12:30 ? 03:16:24 [netstat]
[gilles@client ~]$ top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
27483 munin 40 0 0 0 0 R 99.7 0.0 196:55.31 netstat
[gilles@client ~]$ sudo netstat -tpan|grep 4949
tcp 0 0 0.0.0.0:4949 0.0.0.0:* LISTEN 10749/munin-node
tcp 0 0 10.177.133.82:4949 10.177.140.74:47313 CLOSE_WAIT 27483/
[gilles@server ~]$ sudo netstat -tpan | grep 10.177.133.82
[gilles@server ~]$
munin-node is waiting for a close() but the server already finished running.
Nothing relevant in the munin-update.log
I realize that I'm not giving much but has anyone ever encountered this issue?
Thanks
--Gilles
------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm |
| Back to top |
|
 |
Posted: Fri Jul 30, 2010 1:17 pm Subject: zombie process in CLOSE_WAIT mode |
|
|
Lupe Christoph
|
|
On Friday, 2010-07-30 at 09:26:58 -0700, Gilles Devaux wrote:
| Quote: | [gilles@client ~]$ top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
27483 munin 40 0 0 0 0 R 99.7 0.0 196:55.31 netstat
|
This is very curious. This programm should not go into an endless
loop. Please strace it to find out what it is doing.
| Quote: | [gilles@client ~]$ sudo netstat -tpan|grep 4949
tcp 0 0 0.0.0.0:4949 0.0.0.0:* LISTEN 10749/munin-node
tcp 0 0 10.177.133.82:4949 10.177.140.74:47313 CLOSE_WAIT 27483/
|
Run ps -ef --forest to see if munin_node is waiting on the netstat you
showed above.
| Quote: | munin-node is waiting for a close() but the server already finished running.
|
No, it is hanging and not doing its close while the munin server has
already closed its side of the connection. (Probably due to a timeout.)
| Quote: | I realize that I'm not giving much but has anyone ever encountered this issue?
|
Number Five needs more input ;-)
Lupe Christoph
--
| It is a well-known fact in any organisation that, if you want a job |
| done, you should give it to someone who is already very busy. |
| Terry Pratchett, "Unseen Academicals" |
------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm |
| Back to top |
|
 |
Posted: Fri Jul 30, 2010 2:02 pm Subject: zombie process in CLOSE_WAIT mode |
|
|
Gilles Devaux
|
|
Thanks Lupe,
answers inline
On Jul 30, 2010, at 10:55 AM, Lupe Christoph wrote:
| Quote: | On Friday, 2010-07-30 at 09:26:58 -0700, Gilles Devaux wrote:
| Quote: | [gilles@client ~]$ top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
27483 munin 40 0 0 0 0 R 99.7 0.0 196:55.31 netstat
|
This is very curious. This programm should not go into an endless
loop. Please strace it to find out what it is doing.
|
I did attached strace but the output was empty, nothing printed
| Quote: |
| Quote: | [gilles@client ~]$ sudo netstat -tpan|grep 4949
tcp 0 0 0.0.0.0:4949 0.0.0.0:* LISTEN 10749/munin-node
tcp 0 0 10.177.133.82:4949 10.177.140.74:47313 CLOSE_WAIT 27483/
|
Run ps -ef --forest to see if munin_node is waiting on the netstat you
showed above.
|
did that too :)
There were no parent process, not even init, that's why I thought of zombie but I have a doubt now because zombies are supposed to have finished, no CPU used. In any case there was no parent process.
| Quote: |
| Quote: | munin-node is waiting for a close() but the server already finished running.
|
No, it is hanging and not doing its close while the munin server has
already closed its side of the connection. (Probably due to a timeout.)
|
right, my mistake
| Quote: |
| Quote: | I realize that I'm not giving much but has anyone ever encountered this issue?
|
Number Five needs more input ;-)
|
I have rebooted the server (only way to get rid or it)
This is happening rarely, do you think of something more to do next time this happens?
Thanks a lot
| Quote: | Lupe Christoph
--
| It is a well-known fact in any organisation that, if you want a job |
| done, you should give it to someone who is already very busy. |
| Terry Pratchett, "Unseen Academicals" |
------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
|
------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm |
| Back to top |
|
 |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
Powered by phpBB © 2001, 2005 phpBB Group
|