CHECKDISK
Yeni sunucuda disk kontrolü
Yeni bir sunucu açıldığı zaman kurulum veya veri aktarımı yapılmadan evvel hard disklerin sağlıklı olduğunu kontrol ediniz. Sunucunuza size sağladığımız root şifresi ile SSH erişimi sağlayınız.
- 1- Arıza bildirimleri için log kayıtlarını inceleyiniz.
cat /var/log/messages
Oct 6 11:27:02 CentOS-65-64-minimal rsyslogd: [origin software="rsyslogd" swVersion="5.8.10" x-pid="1265" x-info="http://www.rsyslog.com"] rsyslogd was HUPed
Oct 6 18:51:16 CentOS-65-64-minimal kernel: md: md2: resync done.
Oct 6 18:51:16 CentOS-65-64-minimal kernel: md: resync of RAID array md1
Oct 6 18:51:16 CentOS-65-64-minimal kernel: md: delaying resync of md0 until md1 has finished (they share one or more physical units)
Oct 6 18:51:16 CentOS-65-64-minimal kernel: md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
Oct 6 18:51:16 CentOS-65-64-minimal kernel: md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for resync.
Oct 6 18:51:16 CentOS-65-64-minimal kernel: md: using 128k window, over a total of 524224k.
Oct 6 18:51:16 CentOS-65-64-minimal kernel: md: resuming resync of md1 from checkpoint.
Oct 6 18:51:20 CentOS-65-64-minimal kernel: md: md1: resync done.
Oct 6 18:51:20 CentOS-65-64-minimal kernel: md: resync of RAID array md0
Oct 6 18:51:20 CentOS-65-64-minimal kernel: md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
Oct 6 18:51:20 CentOS-65-64-minimal kernel: md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for resync.
Oct 6 18:51:20 CentOS-65-64-minimal kernel: md: using 128k window, over a total of 16777088k.
Oct 6 18:51:20 CentOS-65-64-minimal kernel: md: resuming resync of md0 from checkpoint.
Oct 6 18:52:21 CentOS-65-64-minimal kernel: md: md0: resync done.
Oct 7 07:39:02 CentOS-65-64-minimal yum[8423]: Installed: 1:smartmontools-5.43-1.el6.x86_64
Genelde disk arızaları şu şekilde kayıt oluşturur:
kernel: sidisk I/O error: dev 08:01, sector 1590410
kernel: SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 28000002
veya
kernel: hde: read_intr: error=0x10 { SectorIdNotFound }, CHS=31563/14/35, sector=0
kernel: hde: read_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
cat /proc/mdstat ile arıza durumunda genel bilgi alabilirsiniz.
- 2- RAID disklerin durumunu kontrol edin.
mdadm --detail /dev/md0
/dev/md0:
Version : 1.0
Creation Time : Mon Oct 6 10:27:42 2014
Raid Level : raid1
Array Size : 16777088 (16.00 GiB 17.18 GB)
Used Dev Size : 16777088 (16.00 GiB 17.18 GB)
Raid Devices : 2
Total Devices : 2
Persistence : Superblock is persistent
Update Time : Mon Oct 6 18:52:21 2014
State : clean
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Name : rescue:0
UUID : 52dc0599:4b657985:06efca2b:11083c89
Events : 22
Number Major Minor RaidDevice State
0 8 1 0 active sync /dev/sda1
1 8 17 1 active sync /dev/sdb1
- 3- (opsiyonel) Smarttools ile de check disk yapabilirsiniz.
yum -y install smartmontools
smartctl -d ata -H /dev/sdb
Hatasız disk cevabı:
smartctl 5.43 2012-06-30 r3573 [x86_64-linux-2.6.32-431.29.2.el6.x86_64] (local build)
Copyright (C) 2002-12 by Bruce Allen, http://smartmontools.sourceforge.net
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
Disk sorunlu ise gelecek cevap örneği:
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
Please note the following marginal Attributes:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
190 Airflow_Temperature_Cel 0x0022 044 033 045 Old_age Always FAILING_NOW 56 (96 110 58 25)
- 4- Detaylı test.
smartctl -d ata -t long /dev/sda
Testlerin sonucunu gözlemlemek için:
smartctl -l selftest /dev/sda