84VPS宕机将近两天

5月8日5点左右,我的84VPS莫名出现错误,直到当天10点左右我才发现,马上登陆SSH想重启,结果连SSH也登陆不进去,网页后台也打不开,所有的IP都ping不通。

15点左右,马上写了封ticket给84,内容是这样的:

My server does not open.

84回复我说:

Hello,

This seems to have been suspended due to heavy resource abuse.

vzctl exec 3042189 pstree
init-+-crond
|-memcached—5*[{memcached}]
|-mysqld_safe—mysqld—{mysqld}
|-nginx—nginx
|-4*[php-cgi]
|-pstree
|-pure-ftpd—2*[pure-ftpd—pure-ftpd]
|-rsyslogd—3*[{rsyslogd}]
|-saslauthd—saslauthd
|-2*[sendmail]
|-sshd
|-udevd
`-xinetd

看不太懂,觉得大概意思就是我的VPS占用了太多资源,把我给关掉了。疑似我用的站群软件不停发布文章,OpenVZ的VPS本来就是共用资源的,可能当时发的时候,84没有发现是我占用了资源,而且我的探针也显示只占用内存到60%左右,这种探针估计探出的是这台服务器整体的内存使用情况。

没办法,我只得现发ticket,请求重启服务器。

Please help me to restart the system, I can not login to SSH.

84回复我说:

We can only reactivate the server for one hour for you to resolve this issue. Please confirm that you will be able to do so in that time frame.

大概意思是:你它丫的占太多资源了,老子一小时后重启服务器,你做好准备,准备好了告诉我,我重启后再观察你的表现,表现好了开放。表现不好再把你关了。

然后,我苦等了5小时,不见重启,再写ticket去问,原来是要我准备好了告诉他们一声,于是我发:i’m ready!过了一会儿重启了,我可以登陆SSH了。马上打开网站看,都算正常。

然后84又说:

Hello,

Your server has been put online temporarily for one hour while you try to resolve this issue. If we do not hear back from you in this time frame your server will be taken back offline. Please update this ticket letting us know what was done to resolve this issue.

大概意思就是跟上面一样:你看看网站什么问题,找出问题了告诉我们一声。不然照样关你。

我当然不知道什么问题啦,所以我回:

I updated the site patches. VPS should be normal.

然后开了一会儿,我又不能登陆了。又被关掉了。84说:

Hello,

Sorry for any inconveniences this may cause you. However, your vps is still causing a high load on the entire node. If this is not resolved immediately your server will be suspended again.

<root – ~> vzctl exec 3042189 pstree
init-+-crond
|-memcached—5*[{memcached}]
|-mysqld_safe—mysqld—6*[{mysqld}]
|-nginx—nginx
|-php-cgi—5*[php-cgi]
|-pstree
|-pure-ftpd
|-rsyslogd—3*[{rsyslogd}]
|-saslauthd—saslauthd
|-2*[sendmail]
|-sshd
|-udevd
`-xinetd

This is most likely related to the memcached processes running on the server.

第二封说:

Hello,

Sorry for any inconveniences this may have caused you. Your server has been taken back offline as this was still causing issues for multiple clients.

我看了后,简直是彻底失望了,大概意思就是说你还是占用了太多资源,不得不把你关掉。等你找出问题所在并解决后,再写信告诉我,我给你放开!

操!!!老子连SSH都登陆不上,怎么发现问题?怎么解决问题啊?

心灰意冷,喝了四瓶啤酒,郁闷地去睡觉了。

结果今天早上打开一看,网站能打开,看宕机记录,是5月9日1点多恢复的。难道是84发现其实也不占太多资源了?难道是昨天的ticket问题?

唉,不管了。反正这次我得小心为好了,站群软件更新,得一个站一个站地更了。上一次的IX空间,就是被我搞废了,这次的84VPS不要再搞废了。总之,肯定是站群软件的问题,这么多线程同时在发内容,服务器不挂掉不行呢。

 

 

更多

Leave a Comment

Your email address will not be published.