Out-of-band data movement is not without its challenges. First, depending on the solution you choose and your environment, it can be very time consuming to crawl the entire environment, especially if there are millions of files to examine. Some of the solutions keep a list of previously scanned files and don't need to examine them again if nothing has changed since the last file system scan. Regardless, be prepared for scan times of hours and maybe days in some cases, especially on the first scan. As a result, these scans are scheduled events that need to be integrated into all the other tasks that you have going on during "idle" times. When a file qualifies for a certain type of action -- move an old file, for example -- that action does not occur until the next time the out-of-band data mover performs its scan.
The final area of concern is how the out-of-band data movers maps its way back to files that it has moved. It does this by leaving a stub file behind. A stub file is a small version of the file that points to where the new location of the original file is. Since they don't have a local agent, out-of-band data movers need to leverage what the operating system provides to be able to create this link. For Windows files, they will leave a shortcut, for Unix systems a symbolic link. These files need to be managed and accounted for. Your backup application has to be set to ignore them and your users taught not to delete them.
Out-of-band data movers provide a sense of simplicity to the data management task. In a future entry, we will compare global file systems, agent-based data movers, and out-of-band data movers to give some guidance on selecting the right technique for your data center.
George Crump is founder of Storage Switzerland, an analyst firm focused on the virtualization and storage marketplaces. It provides strategic consulting and analysis to storage users, suppliers, and integrators. An industry veteran of more than 25 years, Crump has held engineering and sales positions at various IT industry manufacturers and integrators. Prior to Storage Switzerland, he was CTO at one of the nation's largest integrators.