Tech On Tap :: Insights for Simplifying Data Management | NetApp(R) - Network Appliance
To Tech on Tap Home Click to visit NetApp
TECH ONTAP ARCHIVE - MARCH 2007 (PDF)
Stephen Manley
Tim Dietrich
Systems Architect, Agilent Technologies
Tim has been working in IT for over 17 years, with 13 of those years spent at Agilent functioning as an internal IT consultant. Tim was responsible for driving the project described in this article, including evaluating potential solutions, serving as advocate and IT architect, and managing all phases of implementation and rollout.
A Mind Shift in Data Protection
Agilent Moves to Disk-to-Disk Backup
By Tim Dietrich

Agilent Technologies is the world's premier measurement company and a technology leader in communications, electronics, life sciences, and chemical analysis. A spin-off of Hewlett Packard in 1999, Agilent has 19,000 employees, serves customers in more than 110 countries, and had revenue of USD$5.4B in fiscal year 2007.

Mountains of Data

Backup Challenges

One of our key research and development divisions is based in America in Southern California. This division has about 30TB of storage and about 200 computer-savvy engineers.

In 2005, the backup environment for this Southern California site consisted of predominantly network-based backup to tape. We had about 300 systems on the network being backed up to centralized tape libraries. To meet our SLAs, we were keeping copies of the backups locally so we could do quick file restores, and we were also duplicating tapes for off-site storage and disaster recovery. The whole environment was getting to be very challenging to support. Our key challenges included:

  • Long backup windows–approaching or exceeding 24 hours
  • Backups were dragging down system performance
  • Complicated management
  • Low reliability
  • No remote office support
  • Labor-intensive tape duplication process
  • Increasing costs

Tape just wasn't scaling for us. We were seeing huge data growth year over year, and continuing to address that problem with backup to tape was no longer getting the job done.

Storage Infrastructure Overhaul

We were already pursuing the idea of centralizing the business data that was spread across our distributed systems down to a few storage devices in order to make it easier for the R&D staff to manage their data and to improve our storage utilization and data availability. However, the real clincher on this strategy came when we spoke to NetApp and learned about SnapMirror® and SnapVault technology available with NetApp storage systems. We knew we had found a backup solution that offered the sea change we needed to overcome our existing backup limitations.

We wound up deploying a number of frontline NetApp storage systems–FAS900 and FAS3000 series systems–and centralizing all of our data on them for primary storage. We also deployed a NetApp R200 as a second tier. That allows us to use our FAS storage for what it is intended–high-performance storage–and use less expensive storage for backups.

Client systems primarily access the centralized storage using NFS and CIFS, which provides our engineers the most flexibility in reaching their data from anywhere on the network. We had also been running Fibre Channel SAN in the past for a few servers that required greater I/O performance, but found SAN to be terribly complex to support and expensive to maintain. For the situations where we still needed SAN levels of performance, we started relying on iSCSI, which we´ve found to be very fast, reliable, and much more cost effective.

We adopted SnapVault for backing up our primary storage systems to our secondary storage; we back up that secondary storage to tape using NDMP for disaster recovery purposes. Because we only do tape backups once a week now, as opposed to backing up to tape every day, we have the luxury of taking as much time as we need to complete those tape backups and have dropped from a couple dozen tape drives down to four.

Mountains of Data

Figure 1: Agilent Network Architecture

For those few systems where for whatever reason we couldn't use centralized storage, we adopted Open Systems SnapVault (OSSV) to allow those legacy systems to utilize the new backup paradigm. We also use SnapVault to back up data from remote office storage systems in Santa Clara and Atlanta to the R200 in our main location. We typically run those backups on the weekend, and just keep enough local Snapshot™ copies on the remote storage systems to allow end users to do any necessary file recoveries during the week.

SnapVault Impact

The biggest benefit for Agilent is that SnapVault has significantly cut our backup times. In some cases, we went from 24-hour time spans down to minutes. The whole process of replicating data from the primary storage systems is pretty darn fast. Plus, it's nowhere near as system intensive as the old backup to tape, where you always had to walk through the whole file system, creating a big performance drag. SnapVault knows what blocks have to move in advance and just gets it done. By eliminating the backup burden from our servers and primary storage, we're able to truly use those systems to meet the business needs that we bought them for in the first place. 

Here´s a list of what I see as the key benefits:
  • Backup times cut from many hours to minutes
  • Eliminated burden on primary storage and servers
  • End users can do their own file restores without assistance from IT, which improves their productivity and ours
  • Much greater reliability
  • Cost savings as a result of ease of management

If we had tried to scale our old backup architecture to meet the needs of all three sites, we would have had to invest well over $400k in tape hardware and software in order to meet our backup SLAs, and we still would have been struggling with the same old labor-intensive support issues that had always caused us so much pain. By contrast, the SnapVault-based backup solution cost us less than $300k to deploy, and reduced our support burden by about 500 person hours a year over the old solution.

Overall, it's just been a much happier experience dealing with this new backup environment. We have a lot more confidence, and the user population is a lot happier. The IT team is a lot happier. We're saving money on the deal. So I think it's just been a real win all the way around.

Want to Learn More about SnapVault?

In a recent Webcast, the Tim Dietrich joined a team of NetApp data protection specialists, including NetApp data protection architect Stephen Manley and services engineering director Kelvin Mayo to drill down on SnapVault technologies, elaborate further on the Agilent implementation, and detail deployment best practices. Watch the Webcast!


 

RELATED INFORMATION

A Quick Primer on NetApp Data Protection Software

NetApp customers have two potential alternatives for data protection: SnapMirror® or SnapVault software.

SnapMirror is replication software intended for disaster recovery solutions. The mirror is an exact replica of data on the primary storage including all the local snapshot copies and can be mounted read/write to recover from failure. If a snapshot backup is deleted on the source, it will go away on the mirror at the next replication.

SnapVault, in contrast, is intended for disk-to-disk backup. A separate snapshot retention policy is specified for the target environment allowing long term archival of snapshot backups on secondary storage. Secondary storage used by SnapVault cannot be mounted read/write. Backups must be recovered from secondary storage to the original or an alternative primary storage system in order to restart.

At a more technical level, SnapVault takes a point-in-time image based on qtrees, while SnapMirror offers both qtree-based replication or replication of a full volume.

Get the details. Read the reports:


Information-Optimized Backups

It used to be that a larger library, faster tape drive, upgraded network, or new backup server kept backups running.

Over the past five years, however, disk capacities have grown 20x, with a scant fourfold disk speed performance improvement. This has made it more time consuming moving entire copies of data off of disk during a full backup.. Today, many companies are turning to smarter backup solutions that:

  • Shrink backup windows with snapshot backups and only replicating incremental data changes

  • Reduce backup storage consumption by eliminating redundancy within backup files

  • Ensure reliable, recoverable backups by validating backups as they are created

Read When Brute Force Isn't Enough: The Value of Information for Backups


Protection Manager is an intuitive policy based management application for NetApp disk-based backup and replication technologies including SnapMirror, SnapVault, and Open Systems SnapVault.

This tool allows administrators to apply predefined policies to their data, thereby eliminating ambiguity and the potential for error inherent in manual management.

A simple dashboard depicts comprehensive data protection information at a glance, including unprotected data, longest lag times, etc. The application automatically correlates the datasets and underlying physical storage resources, so administrators don´t need to think in terms of the storage infrastructure.

Read the NEW Protection Manager Technical Report. (pdf)

Innovative Disk-to-Disk Backup Solutions.

SUBSCRIBE | PROVIDE FEEDBACK