Lazy Linux Admins Going to Server Rooms Less: Forced Reboot, Auto Reboot after Kernel Panic and Email Notification after Reboot

ByEric Ma Feb 2, 2015Aug 30, 2020

Having to go the the server room to reset servers is the most headache thing for admins managing a cluster of Linux servers in a remote site. Either you can ping the server but can not ssh to it, or you even can not ping it. There are various reasons that may cause a Linux server crash or fail to be connected to by SSH. The most common two from my experience are: there may be a bad behaving progress that use up almost all physical memory and swap or there may be a kernel panic. In this post, I describe several techniques I learned to make myself go to the server room less by dealing with these kinds of failures.

Force Linux to reboot even you could not start a shell via SSH

If the server is too busy, creating the shell via SSH may also fail even though sshd is alive. Some times, you get lucky that you can remotely execute some commands by ssh directly. You may try to make use of the magical SysRq to force Linux to restart.

ssh root@server_home \
'echo 1 > /proc/sys/kernel/sysrq; echo b > /proc/sysrq-trigger'

Reference: Force Linux to reboot.

After this command, if you find your server disappear from the network, it may be rebooting itself. Wait for a while and it may come back.

Make Linux reboot automatically after a kernel panic

Some times, you get bad luck that there is a kernel panic. Almost everything including the network stop working and you can not connect to the server any more. That is not good but may not be too bad if we did some home work before by configuring Linux to reboot itself after kernel panics.

Linux has a nice feature that reboots itself after a timeout if a kernel panic happened. Usually, it is disabled. We can turn it on as we are lazy system admins. It can be enabled by setting the kernel.panic kernel parameter.

For a running system:

# echo 20 >/proc/sys/kernel/panic

Here, 20 is the number of seconds before the kernel reboots. 0 means this feature is disabled.

To make the configuration persistent, you have at least 2 choices:

add the kernel parameter panic=20 to your bootloader (grub or grub2).
add kernel.panic = 20 to /etc/sysctl.conf .

I prefer the second method that writes the configuration to /etc/sysctrl.conf.

For more details, please check How to make Linux automatically reboot after a kernel panic.

Email notifications after Linux reboot

Auto reboot is good. It will be better that the server also notifies the admins after a reboot. The technique discussed at How to email admins automatically after Linux server starts makes the server send email notifications after reboots.

It makes use of the @reboot cron jobs and mailx by adding an entry like

@reboot date | mailx -S smtp=smtp://smtp.example.com -s "`hostname` started" -r zma@example.com zma@example.com

For sending emails, you may either https://www.systutorials.com/sending-email-using-mailx-in-linux-through-internal-smtp/ or https://www.systutorials.com/sending-email-from-mailx-command-in-linux-using-gmails-smtp/.

Any suggestion on the hosting service

ByQ A Mar 24, 2018Oct 12, 2019

Any suggestion on the hosting services? Seems Hostgator is great: http://www.warriorforum.com/internet-marketing-product-reviews-ratings/362470-hostgator-bluehost-dreamhost-pair-com.html I was considering moving to Hostgator from Dreamhost. Dreamhost has less limitations after comparing it with others (e.g. the 250k inodes number limitation). I will stick with it. Moreover, with CloudFlare enabled ( https://www.systutorials.com/b/web/3218/cloudflare-with-dreamhost/ ), it looks better. Read more: Shared hosting services with…

Classpath for compiling MapReduce jobs on Hadoop 2.2.0

ByEric Ma Mar 24, 2018Mar 24, 2018

How to get the correct classpath for compiling MapReduce jobs on Hadoop 2.2.0 (YARN)? The yarn command from Hadoop 2 can find it out for you: yarn classpath You may add the full path to yarn which is under bin directory of the Hadoop distribution pachage, if it is not in your $PATH. Read more:…

Linux

Converting Movie Files to wav and mp3 Files Using MPlayer and LAME

ByEric Ma Jul 13, 2013Mar 25, 2023

As a multimedia enthusiast, you may want to convert your movie files to audio files for various reasons such as creating soundtracks, audio books or listening to dialogues & music without the video. Converting movie files to WAV and MP3 files using MPlayer and LAME is a simple and straightforward process. By following the steps…

How to disable all swaps on Linux

ByQ A Mar 24, 2018Mar 24, 2018

How to disable all swaps on Linux to force the application/kernel use the physical memory? To diable all swaps on Linux, run this command as root: # swapoff -a -a means all swaps. Similarly, to enable all swaps on Linux: # swapon -a Read more: How to disable the coredump function of systemd on Linux?…

Linux | Mobile | Software | Tutorial

How to Get Rid of DTS/AC3 Audio using ffmpeg on Linux to Play MKV Files on iOS or Android

ByDavid Yang Apr 15, 2015Aug 30, 2020

I encountered the problem on iPhone that MKV video files with AC3 are played with no sound. The OPlayer reports to me that “According to DTS patent, DTS is forbidden to play , None of the media player on iPhone/iPad can play DTS”. However, the video file can be played in MPlayer on Linux just…

News

New Linux Kernel 5.0: Features and Improvements

ByAnjaneyulu Naini Apr 13, 2019Nov 21, 2019

Linux is the most used and well-known open-source operating system for computers, mobile devices, servers, and mainframes, etc. Linux has so many awesome features to serve its users like Live CD/USB. And it is fast, easy and free to use by computers around the world. The kernel is referred to as the essential component of…

4 Comments

chen says:

Jul 30, 2015 at 10:30 am

simply hire a intern and let him do these headache works

Reply
1. Eric Zhiqiang Ma says:
  
  Jul 30, 2015 at 11:38 am
  
  Interns could be more productive.
  
  Reply
Sunil Kumar Medi says:

Dec 10, 2015 at 11:47 am

What if the remote root ssh is disabled for security purpose ? How do you reboot remotely ?
Shouldn’t these critical server have remote power management tools like ILO by HP, DRAC by DELL.

I think remote power management tools are the best options in such conditions.

Reply
1. Eric Zhiqiang Ma says:
  
  Dec 10, 2015 at 1:13 pm
  
  The tip here is for server management via ssh. Of course a piece of hardware independent of the software in the server is more reliable.
  
  Reply

Force Linux to reboot even you could not start a shell via SSH

Make Linux reboot automatically after a kernel panic

Email notifications after Linux reboot

Similar Posts

4 Comments

Leave a Reply Cancel reply