Last week I found a very challenging issue on a Microsoft Windows Server 2003 R2 database cluster.
We were planning on doing some maintenance on a cluster. After we were done, we found out that not all cluster resource groups were working on one of the nodes.
After investigating, it seemed that those resource groups had not worked on that node since somewhere in October last year!
Problem:
Cluster resources could not be brought online on a specific cluster node. The resource works fine on other nodes.
Only the pfysical disk cluster resource could not be brought online, the IP Address and Network Name have no problems with being brought online.
Cause:
It seems someone has been playing around on the cluster node with a tool named “TrueCrypt”.
This tool claims drive letters. When you do that on a passive cluster node where the drive letter is assigned but not in use, it will corrupt the drive assignment.
Solution:
Following these steps has solved my issue, since it basically re-creates the disk association on the cluster node:
Lesson learned:
Test all cluster resources on all cluster nodes before you begin maintenance on a cluster.

Categories
Tag Cloud
Blog RSS
Comments RSS
Last 50 Posts
Back
Void « Default
Life
Earth
Wind
Water
Fire
Light 