前几天我们公司的一个服务器宕机了,ping不通ssh连不上。只好让IDC机房工作人员帮我们重启我们的服务器。重启完之后赶紧查看日志,但是自身服务日志并没有报错。接下来就是分析硬件问题了。我们服务器是DELL的,经理让我安装了一个DELL 的检测工具。
Dell System E-Support Tool (DSET)这个工具可以用来收集服务器硬件信息,存储信息(RAID卡,硬盘等)。及linux 驱动,服务,网络设置等等,同时又包括CPU,memory, ESM log, BIOS/firmware versions and system health (fan/voltage levels).

下载地址:
http://support.dell.com/support/topics/global.aspx/support/en/dell_system_tool

Dell,System,E-Support,Tool

安装步骤

1 授予权限执行这个可执行文件
[root@www ~]# chmod +x delldset_v2.2.0.122_x64-A00.bin
[root@www ~]# ./delldset_v2.2.0.122_x64-A00.bin
 。。。。。。。。。。。。
 PARTICULAR PURPOSE, TITLE AND ANY WARRANTY OF NON-INFRINGEMENT.   YOU WILL
 USE THE SOFTWARE AT YOUR OWN RISK. DELL SHALL NOT BE LIABLE TO YOU FOR ANY
 DIRECT OR INDIRECT DAMAGES INCURRED IN USING THE SOFTWARE. IN NO EVENT SHALL
 DELL OR ITS SUPPLIERS BE RESPONSIBLE FOR ANY DIRECT OR INDIRECT DAMAGES
 Dell License (42%): Press spacebar to view next page, 'q' to proceed

 2,按q之后出现是否接受协议,直接按y

 DELL OR ITS SUPPLIERS BE RESPONSIBLE FOR ANY DIRECT OR INDIRECT DAMAGES

Do you accept the terms of this license? (y/n):

 3,按y之后出现如下提示
 Dell System E-Support Tool (DSET) Options:
Choose an option:
 1) Read DSET Release Notes First
      Show latest information concerning features and known issues

 2) Create DSET Report Only
      Creates a DSET report and saves it to user's home directory

 3) Clear ESM Hardware Log Only
      Only clears the ESM Hardware Log contents

 4) Install/Upgrade DSET Application
      Permanently installs or upgrades the DSET application for repeat use

Enter option (1-4) or 'q' to quit:

 4,选4安装

 Install Location:
 Where should DSET be installed?
 Default location: /opt/dell/dset   //默认程序安装位置
 Press Return to accept the default location or
 enter a new directory path:

Directory does not exist. Create? (y/n): y

Preparing...                ########################################### [100%]
    1:delldset               ########################################### [100%]

Installation of Dell System E-Support Tool (DSET) complete.
 Enter 'dellsysteminfo' from a terminal shell prompt to create a report file.

 5,查看帮助

 [root@www ~]# dellsysteminfo -h
 Dell System E-Support Tool
 @Copyright Dell Inc. 2004-2008 Version 1.6 build 135
 The given option is invalid: ['-h']
 Usage: dellsysteminfo [-options] [-f filepath/filename]

Options:
    -f              Specify a filename, a path using default filename, or both
    --nohardware    Skips collecting info for all hardware categories
    --nostorage     Skips collecting info for all storage categories
    --nosoftware    Skips collecting info for all software categories
    --nologs        Skips collecting any non-Linux log files
    --time          Append report filename with timestamp
    --silent        Accept defaults and prevent user prompting (for scripting)
--advanced      Collect various advanced logs (may create large report size!)

6 获取系统报告,-f 指定报告位置在/home/report.zip,这里会等一段时间,这是他正在检测系统硬件系统,存储系统和操作系统信息,检测完,/home/目录下回产生一个report.zip 就是我们要的报告

 [root@bogon ~]#dellsysteminfo -f /home/report

7 查看报告内容。
 

使用ssh工具把report.zip下载到我们本地计算机上,然后解压缩包,密码dell

Dell,System,E-Support,Tool

 

8 解压缩完了之后,双击dsetreport.hta打开报告内容

Dell,System,E-Support,Tool

9 系统总体概览

Dell,System,E-Support,Tool

10 硬件日志。这里看到我们的cpu有一个出问题了。

Dell,System,E-Support,Tool

11 下面这里我们硬件日志,这里我们看到,6月17日22:22:03首先在检测到设备0上有错误,接着就是6月18日21:50:55内存发生持久错误,中间重启过一次系统,正常了一段时间,有出错。最下面是从6月27号又开始出现错误,我们又重启了系统。

Dell,System,E-Support,Tool

12 以下这些是软件信息没有什么错误,是有关操作系统。下面这个是启动项信息

Dell,System,E-Support,Tool

13 驱动及模块信息

Dell,System,E-Support,Tool

14 开机启动过程信息

Dell,System,E-Support,Tool