Hey guys,
Just wondering, anyone else here noticing odd temperatures on your servers? I've noticed mine were going up and down all night.
Just going through logs and found these:
Jul 31 11:40:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 68 to 67
Jul 31 11:40:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 32 to 33
Jul 31 11:40:52 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 66 to 65
Jul 31 11:40:52 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 34 to 35
Looks good there? Yup.
However, few hours later:
Jul 31 21:40:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 63 to 62
Jul 31 21:40:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 37 to 38
Jul 31 21:40:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 62 to 61
Jul 31 21:40:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 38 to 39
A little high.. max drive temperature reported by SMARTD is max of 41C recommended.. That's pretty close to the limit.
But then...
Aug 1 00:10:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 63 to 62
Aug 1 00:10:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 37 to 38
Aug 1 00:10:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 61 to 60
Aug 1 00:10:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 39 to 40
That's even closer to the limit...
And then, a sudden drop:
Aug 1 05:10:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 63 to 68
Aug 1 05:10:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 37 to 32
Aug 1 05:10:51 web smartd[2885]: Device: /dev/sdb [SAT], 8 Currently unreadable (pending) sectors
Aug 1 05:10:51 web smartd[2885]: Device: /dev/sdb [SAT], 8 Offline uncorrectable sectors
Aug 1 05:10:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 60 to 66
Aug 1 05:10:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 40 to 34
Aug 1 05:40:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 68 to 69
Aug 1 05:40:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 32 to 31
Aug 1 05:40:51 web smartd[2885]: Device: /dev/sdb [SAT], 8 Currently unreadable (pending) sectors
Aug 1 05:40:51 web smartd[2885]: Device: /dev/sdb [SAT], 8 Offline uncorrectable sectors
Aug 1 05:40:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 66 to 68
Aug 1 05:40:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 34 to 32
And currently:
Aug 1 10:10:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 63 to 64
Aug 1 10:10:51 web smartd[2885]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 37 to 36
Aug 1 10:10:51 web smartd[2885]: Device: /dev/sdb [SAT], 8 Currently unreadable (pending) sectors
Aug 1 10:10:51 web smartd[2885]: Device: /dev/sdb [SAT], 8 Offline uncorrectable sectors
Aug 1 10:10:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 62 to 63
Aug 1 10:10:51 web smartd[2885]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 38 to 37
So yeah, temperatures seem to be all over the place.
Anyone else noticing this? The server in question is in low I/O load right now.