NASA on .NET Rocks!#

Just finished recording a new episode of .NET Rocks! My second as co-host.

On this week's show we interviewed Chris Maxwell and Randy Kim from who work at NASA's Ames Research Center on a product called WorldWind. Its similar to Google's Keyhole, but its free (well, paid for by the US taxpayer), and its got a stronger educational bent. Essentially they've gathered together lots of different bits of satellite data that you can use to explore the planet with. Very, very cool. And all written in C# and the .NET Framework 1.1!

The chat room tonight was really cool - plus the WorldWind folks have their own chat channel as well, so we had lots of intermingling between the groups.

Even Robert Scoble showed up and hey, we got Scobleized!

 

Friday, February 25, 2005 10:02:21 PM (Pacific Standard Time, UTC-08:00) #    Comments [5]  | 

 

Rebuilding Cartman...#

I've been slowly working my way through the server rack, upgrading all of my servers. Some of the machines are as much as five years old, and all spinning gear (CPU fans, case fans, hard drives) are essentially ticking time bombs. In addition there is new hardware to be added to the rack, which means virtually everything in the rack has to move... the new configuration with eight servers completely fills the 30U rack.

What makes this especially challenging is that they ARE servers... they're constantly in use. I can take them down for a few minutes, but after a half hour the phone starts to ring. However, some servers are more sensitive to this than others - and Cartman is one of the least sensitive, since its largely an internal-only server.

Cartman has a variety of tasks. Primarily he's a file server, but also a domain controller (one of two), DHCP and DNS server. As a file server, he has a 400GB RAID array... doesn't sound like much, but I built it in October of 2001. Its done with a Promise SX6000 controller and six 80GB hard drives. At the time, it was a monster. Since its essentially been on since it was first built those drives have over 30,000 hours of spin time... very scary.

Before tearing Cartman apart I used Acronis True Image to image the boot drives, and I backed the entire 400GB drive array up on a single external USB 400GB drive. And yes, I used xcopy with verify and double checked everything before I tore it down.

This is what I saw after hauling Cartman out of the rack and popping the cover. Essentially identical to what I saw in October 2001 - one crammed case. You can see the six ATA/100 ribbon cables coming out of the Promise controller running to the two three drive caddies holding the 80GB drives. In the middle are the two 17GB SCSI drives that are used as boot drives, which, along with the SCSI DVD drive are run from the Adaptec 29160 SCSI controller. Oh, and an Exabyte external tape drive plugs in there too.

Disassembly of this beast starts with the metal bar running across the case that also supports the two SCSI hard drives (and a fan). Then the entire front drive array holding the DVD, floppy and two drive caddies was removed. Both the SCSI and RAID controllers were pulled as well, leaving the case pretty darn bare. With everything out I powered up the machine just to take a look and noticed that one of the CPU fans was barely spinning any more. I had planned on replacing them anyway, this was just extra incentive.

However, the motherboard is so busy that the fancy new Socket 370 cooling blocks I bought wouldn't even fit in the space! But I was able to use the old blocks by removing the worn out fans with the the fans from the new blocks.

After a thorough cleaning, I installed a gigabit network card and began the rest of the reassembly. I'm retiring the Promise controller altogether, going to a SATA array using six Hitachi Deskstar 7K400 drives. Yep, that's right... from a 400GB array to 400GB drives, for a total of two terabytes! And to drive this puppy, I'd need a SATA controller, so I went back to Adaptec for their 2810SA controller.

It actually supports eight drives, but I only had space for six, you can see the controller hard and new caddies to hold the drives. SATA cables are much tidier than ATA cables, so I got a bunch of space back in the case.

Here you can see the Chenbro caddies with three SATA cables a peice. There's one power plug for all three drives (which is very nice) and it also has a heavy blower fan pumping directly onto the drives.

The old 17GB Atlas V drives are replaced with shiny new 147GB Atlas 10Ks. More disk space!

With everything crammed back in the case, it was time to get things set up. Even before I started the install of Windows 2003 server I wanted to get the array set up. What was interesting is that every card installed in the machine had a boot BIOS in it - the SCSI controller, the RAID controller AND the gigabit network card! Getting the BIOS set up to boot from the right device took some fiddling.

Then I decided to start the array configuration from the BIOS, so I set up a RAID 5 array. Being a dilligent geek, I went to the Adaptec web site to check for latest drivers, BIOS updates, and so on. Adaptec had updates for both the 2810SA and the 29160, so I updated both BIOSes. What's stunningly annoying is that you HAVE to install BIOS updates from a floppy. The software is hard coded to read from drive A and nowhere else. Presumably I could set up a USB drive to do this, but this old SuperMicro motherboard ain't that smart.

I was glad I'd checked all this in advance, all over the readme files for the firmware were warnings that doing these upgrades would destroy the existing arrays, and you'd need to back everything up. Since I had nothing on the drives, I had nothing to fear.

Feeling smug with all my firmware flashed, I headed off into the BIOS set up for the 2810SA to get my spiffy new drive array configured. Apparently I did it wrong because I selected “Clean” to start the array rather than “Build/Verify.”

But I didn't know this at the time - off it went, ticking away to itself. I thought it might take a long time to set up a two terabyte array, but it was done in about 15 minutes... well, almost done. It got to 99% and then said “Controller Kernel Stopped Running!” And then the machine would reboot. That didn't seem good.

Every time I restarted the machine and went back into the 2810SA BIOS, I'd get the same error and reboot the machine.

In an effort to be positive about my situation, I ignored the failure and moved on - set up Windows 2003 Server. Once it was up and running, I tried to install the drivers for the controller card, but it wouldn't recognize it. That can't be good either. I filed a tech support request with Adaptec, but wouldn't hear back for 48 hours: by then I would solve it on my own.

I went to bed late, very grumpy. The next morning I woke up thinking maybe the firmware update was a mistake. So I reverted - got the old firmware, set up new floppies and attempted to install it. But it kept failing with the same error. Couldn't revert.

Then, a flash of insight, I realized what was happening to the controller - it was crashing! And right at the point of completing the array. After it rebooted, the controller would restart, see the array almost finished configuring and attempt to finish it... crashing the controller again! So, how to stop the array from rebuilding? Pull all the hard drives out! That'll slow the bugger down.

Sure enough, as soon as I pulled the drives, I was able to revert the firmware. Why I still reverted the firmware, I'm not sure - I guess I had a course in mind and thinking wasn't going to divert it. With the firmware reverted, the array had died, so when I plugged the drives back in, nothing bad happened.

Now afraid of the BIOS configuration stuff, I booted back into Windows, and reverted the driver as well to match the firmware. If you've never done this, you're a happier person than me: reverting to an older driver is a bugger. Windows 2003 Server has a rollback driver option, but it doesn't work if you haven't previously installed the older driver. So I had to do this the hardware - uninstall the driver and then carefully locate all the backup copies of the DLLs and kill them by hand. Once I had it all, installing the old driver worked, AND it came up just fine.

Now I was able to set up the RAID 5 array from Adaptec's client for Windows, which was a whole bunch clearer about the right ways to do things. And that's when I discovered that correctly building a two terabyte array takes an entire day.

The next day I discovered that my two terabyte array is actually a 1.8TB array. And that Windows understands TB, it displays that way in Windows Explorer. Funny, huh? I wonder if they have PB (as in petabyte, a thousand terabytes) in there as well.

The rest of the set up was uneventful, really... things got loaded back on, DHCP and DNS configured, and so on. The next level of excitement would come with the most dangerous update of all... converting an Exchange 2000 server to 2003!

Wednesday, February 23, 2005 6:52:24 PM (Pacific Standard Time, UTC-08:00) #    Comments [8]  | 

 

Rack Attack!#

Well, I finally broke down and started to rework my racks. I've literally avoided pulling them for more than a year, just patching things together whichever way I could. Take a look at the mess they were in before I started:

Several highlights of this mess I call my racks... notice the two bars poking out the front, those are the rails that the entire rack slide out on. Notice that between the two racks there's a new server (named “Tweak”) that has been sitting like that for six months. And notice the freakshow of a wiring mess as I've added VOIP boxes, a new router, new wireless access point (sitting on top of Tweak), and so on. Hey, its been more than a year!

The racks themselves are 30U Middle Atlantic AXS racks. The left hand one is for networking, it has a cable channel mounted on the left side for all the wiring. On the right is the server rack, which I had modified to be 30 inches deep instead of the standard 20 inches that Middle Atlantic makes for these racks. They're intended for stereo equipment, I use them for the computer gear because this way the server closet is much smaller - you don't need room to walk around it.

This is the rack pulled out onto the rails and ready for some service work. You can see the cable channel clearly now.

From the other side you can see the mess of wiring strung between the two racks... and the mess of wire in the back. Its not as bad as it looks (which is good, it looks pretty bad). Notice also the “wall-shaker“ style air conditioner that keeps the whole closet cool.

Besides the tangled mess of wiring, I also needed to add more power plugs, re-arrange some components, add new gigabit switches and additional wiring between the two racks.

A couple of hours later, the mess of wires is gone from the rack. This shot also shows the new double-sided power bar I added at the back to give myself more outlets, and the Oregon Scientific wireless temperature sensor (reading 71.6F) that lets me know the temperature inside the closet. Normally its about 68F in there. There are alarms if it climbs above 75F. Also, this gives you a pretty good look at the folding arms that hold the rack from sliding off the end of the rails, and provide a channel to route the wires on and off the rack.

Here's the beauty shot of the network rack reconfigured and back in the closet. Here's an inventory (from top-to-bottom):

  • Gear shelf contains
  • Xincom 603 Dual WAN NAT router
  • Linksys SR2024 24 port Gigabit switch
  • 2U cable tray
  • 2U 48 port Ethernet patch panel
  • 2U cable tray
  • Linksys SR224G 24 port 10/100 switch (with Gigabit uplink)
  • 1U Keyboard/Mouse/Monitor console
  • Cisco 3620 (mounted backwards)
  • 3U 48 port keystone patch panel (telephone and cable patches)
  • The old Nexland dual WAN NAT router
  • 5U gap (more UPSes will go in here in the future)
  • 1U power bar
  • 3U Hewlett-Packard rack-mount oscilloscope (long story)
  • 2U Minuteman 1000VA UPS (cut off in the photo)

That one bright green Ethernet cable you see in the shot is the patch cable for Tweak, the server still sitting on its side between the racks. I ran a new patch for it through the rack properly.

Next up, the server rack! And believe me, the network rack was the easy part of this whole process.

Tuesday, February 15, 2005 7:41:09 PM (Pacific Standard Time, UTC-08:00) #    Comments [7]  | 

 

All content © 2023, Richard Campbell
On this page
This site
Calendar
<June 2023>
SunMonTueWedThuFriSat
28293031123
45678910
11121314151617
18192021222324
2526272829301
2345678
Archives
Sitemap
Blogroll OPML
Disclaimer

Powered by: newtelligence dasBlog 1.9.7067.0

The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way.

Send mail to the author(s) E-mail

Theme design by Jelle Druyts


Pick a theme: