我需要解决方案
Hello, we had an enviroment with 15 media servers and about a month ago, all drives randomly go down in all media servers.
Perhaps, it throw a hardware flag but I think it is a false positive or the fact he is completly down (from Master to Media s.) he throw the HW fault flag.
Also, with heavy load like Vault or Staging after some time all drives will be down and without any use of drives he stay UP.
Unfortunatelly our NBU is 6.5.4.
Master server is Solaris 10 SPARC.
Media Servers are Win 2003/2008.
Drives are IBM ULTRIUM-TD3 93G0
Actually we didn't have any MISSING_PATH.
In ltid debug I only get the default "EMM/Operator down the drive" and lot of this:
10:06:33.001 [19492] <2> TAO: TAO (19492|1) - Transport_Cache_Manager::is_entry_idle_i, state is [0]
10:06:33.001 [19492] <2> TAO: TAO (19492|1) - Synch_Invocation::invoke_i, timeout on recv is <299999>
10:06:33.021 [19492] <2> TAO: TAO (19492|1) Synch_Invocation::invoke_i, timeout after recv is <299961> status <1>
10:11:33.976 [19492] <2> TAO: TAO (19492|1) Timeout is <300000>
10:11:33.976 [19492] <2> TAO: TAO (19492|1) Timeout is <0>
A lot of this exactly Hardware Flag:
TapeAlert Code: 0x1f, Type: Critical, Flag: HARDWARE B, from drive TLD.hcart3.3 (index 4), Media Id
Thank you.