//  
BlueQuartz.US Forum Index HOME
BlueQuartz.US
Open Source Info for Open Source Users
RegisterRegister 
MemberlistMemberlist
UsergroupsUsergroups
FAQFAQ   
SearchSearch
SubscriptionsSubscriptions
ProfileProfile   
Private messagesPrivate messages   
Log inLog in 
Are You Paying More Than Wholesale For Your Names?   Join the Hottest Domain Name Reseller Program Around!

 
Post new topic   Reply to topic    BlueQuartz.US Forum Index -> Munin Users
 zombie process in CLOSE_WAIT mode Previous Topic
Next Topic
Message Author
PostPosted: Fri Jul 30, 2010 11:27 am    
Subject: zombie process in CLOSE_WAIT mode
Reply with quote  
Gilles Devaux

Hi,

I'm running into an issue sometimes where a plugin (at least a command run by a plugin) would run 100% cpu in zombie mode

CentOs 5.4 rackspace
[gilles@client ~]$ uname -a
Linux chefserver 2.6.32.1-rscloud #15 SMP Mon Feb 22 13:22:15 UTC 2010 x86_64 x86_64 x86_64 GNU/Linux

[gilles@client ~]$ /usr/sbin/munin-node --version
munin-node (munin-node) version 1.4.3.
Written by Audun Ytterdal, Jimmy Olsen, Tore Anderson / Linpro AS

[gilles@client ~]$ ps -elf | grep netstat
0 S gilles 11993 11972 0 100 0 - 15295 - 15:46 pts/2 00:00:00 grep netstat
1 R munin 27483 1 99 100 0 - 0 ? 12:30 ? 03:16:24 [netstat]

[gilles@client ~]$ top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
27483 munin 40 0 0 0 0 R 99.7 0.0 196:55.31 netstat

[gilles@client ~]$ sudo netstat -tpan|grep 4949
tcp 0 0 0.0.0.0:4949 0.0.0.0:* LISTEN 10749/munin-node
tcp 0 0 10.177.133.82:4949 10.177.140.74:47313 CLOSE_WAIT 27483/

[gilles@server ~]$ sudo netstat -tpan | grep 10.177.133.82
[gilles@server ~]$

munin-node is waiting for a close() but the server already finished running.

Nothing relevant in the munin-update.log

I realize that I'm not giving much but has anyone ever encountered this issue?

Thanks

--Gilles
------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
Back to top
PostPosted: Fri Jul 30, 2010 1:17 pm    
Subject: zombie process in CLOSE_WAIT mode
Reply with quote  
Lupe Christoph

On Friday, 2010-07-30 at 09:26:58 -0700, Gilles Devaux wrote:

Quote:
[gilles@client ~]$ top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
27483 munin 40 0 0 0 0 R 99.7 0.0 196:55.31 netstat

This is very curious. This programm should not go into an endless
loop. Please strace it to find out what it is doing.

Quote:
[gilles@client ~]$ sudo netstat -tpan|grep 4949
tcp 0 0 0.0.0.0:4949 0.0.0.0:* LISTEN 10749/munin-node
tcp 0 0 10.177.133.82:4949 10.177.140.74:47313 CLOSE_WAIT 27483/

Run ps -ef --forest to see if munin_node is waiting on the netstat you
showed above.

Quote:
munin-node is waiting for a close() but the server already finished running.

No, it is hanging and not doing its close while the munin server has
already closed its side of the connection. (Probably due to a timeout.)

Quote:
I realize that I'm not giving much but has anyone ever encountered this issue?

Number Five needs more input ;-)
Lupe Christoph
--
| It is a well-known fact in any organisation that, if you want a job |
| done, you should give it to someone who is already very busy. |
| Terry Pratchett, "Unseen Academicals" |

------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
Back to top
PostPosted: Fri Jul 30, 2010 2:02 pm    
Subject: zombie process in CLOSE_WAIT mode
Reply with quote  
Gilles Devaux

Thanks Lupe,

answers inline

On Jul 30, 2010, at 10:55 AM, Lupe Christoph wrote:

Quote:
On Friday, 2010-07-30 at 09:26:58 -0700, Gilles Devaux wrote:

Quote:
[gilles@client ~]$ top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
27483 munin 40 0 0 0 0 R 99.7 0.0 196:55.31 netstat

This is very curious. This programm should not go into an endless
loop. Please strace it to find out what it is doing.

I did attached strace but the output was empty, nothing printed

Quote:

Quote:
[gilles@client ~]$ sudo netstat -tpan|grep 4949
tcp 0 0 0.0.0.0:4949 0.0.0.0:* LISTEN 10749/munin-node
tcp 0 0 10.177.133.82:4949 10.177.140.74:47313 CLOSE_WAIT 27483/

Run ps -ef --forest to see if munin_node is waiting on the netstat you
showed above.

did that too :)
There were no parent process, not even init, that's why I thought of zombie but I have a doubt now because zombies are supposed to have finished, no CPU used. In any case there was no parent process.

Quote:

Quote:
munin-node is waiting for a close() but the server already finished running.

No, it is hanging and not doing its close while the munin server has
already closed its side of the connection. (Probably due to a timeout.)

right, my mistake

Quote:

Quote:
I realize that I'm not giving much but has anyone ever encountered this issue?

Number Five needs more input ;-)

I have rebooted the server (only way to get rid or it)
This is happening rarely, do you think of something more to do next time this happens?

Thanks a lot

Quote:
Lupe Christoph
--
| It is a well-known fact in any organisation that, if you want a job |
| done, you should give it to someone who is already very busy. |
| Terry Pratchett, "Unseen Academicals" |

------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm



------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
Back to top
Display posts from previous:   
Post new topic   Reply to topic    BlueQuartz.US Forum Index -> Munin Users All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group