| | Home > News > Tech OnTap |
|
|
The NetApp Kilo-Client
A 1,500-node diskless enterprise server farm running iSCSI and FCP.
David Brown and Gregg Ferguson Engineering Support Manager and Laboratory Administrator, NetApp Engineering
David and Gregg helped design, build, and manage the NetApp Kilo Client. This project is one of several firsts for them. As an engineer at Lawrence Livermore labs, David was a founding member of the CIAC, Computer Incident Advisory Council. David also holds a patent for 10 Gbps mobile data center design and helped start the Engineering Support program at the Research Triangle Park (RTP), North Carolina NetApp office. Gregg, a 22-year industry veteran, held a variety of engineering, systems administrator and IT manager positions before joining NetApp in 2000. After opening the original NetApp RTP facility, Gregg spent over four years in the field as a systems engineer before returning to the NetApp Engineering IT team.The below article was originally published in the March 2006 edition of the Tech OnTap newsletter. The article was updated in December of 2006 to reflect the addition of 380 blades (1,500 total) and FC SAN support. To receive the Tech OnTap newsletter and enjoy other great benefits, sign up today. Over the past few years, a variety of new technologies have emerged that promise to change the way data centers are designed and managed. Among these are large computing grids with hundreds or thousands of hosts; The Kilo-Client Architecture Supports Multiple OS
In early 2006, NetApp Engineering combined these technologies to build what we believe is the world's largest iSCSI-based diskless server farm. All server nodes boot from NetApp storage using either iSCSI or Fibre Channel SAN and can be rapidly rebooted to run Linux®, Windows®, or other operating systems. The system leverages NetApp FlexClone technology to rapidly create system images without making full physical copies of those images. The baseline image is shared among all of the nodes of the farm; only host-specific "personalization" needs to be added to the core image for each of the provisioned servers. This unique approach affords near-instantaneous image provisioning with near-zero footprint (only the blocks of the images that differ need to be added to the storage system, which keeps track of the individualized images). The Kilo-Client project began in 2005, when the NetApp Engineering IT team was challenged to create a testing environment capable of meeting or exceeding the most extreme conditions NetApp products encounter in the field. The NetApp team designed a 1,120-blade server farm and originally planned to use diskfull server blades in which each blade booted from a local disk. However, as the project progressed, it became clear that the time and administrative overhead required to copy a boot image to over a thousand local disks would result in more time spent configuring and managing the cluster than running actual tests. Instead, the team chose a diskless approach. Although they weren't aware of anyone using iSCSI-based SAN boot for 1,000+ nodes, earlier that year Gregg had helped deploy a 250-node diskless environment based on FC SAN technology. The team also had access to internal expertise managing a large grid environment (a 400+-node diskfull testbed already in use at the NetApp Pittsburgh Technology Center). The result is the NetApp Kilo-Client, a 1500-node diskless server farm with all nodes booting from a back-end network running both iSCSI and Fibre Channel SAN. This architecture packs massive performance into a footprint of just over 235 square feet and offers many unique advantages:
For details, see the BayLISA presentation The Kilo-Client Host Swarm. |
|