|
|
Enterprise Backup and Beyond: State of the Art Backup Software
From Backup Central
- Redefine backup to include dedupe, CDP, near-CDP (new in this version)
- Common tasks for all backup software
- Prepare data to be backed up
- Read data to be backed up
- Send data across a network
- Accept data from network
- Store data on backup device
- Increase backup speed
- Decrease backup load (duration, effect on servers/networks)
- Record where backup is
- Create additional copies
- Protect the backup history
- Prepare backup for theft in transit
- Send backup to another location
- Collocating data for potential restore
- Expire old backups
- Make old backup media available
- Migrate data between storage devices
- Request data for restore
- Prepare media
- Read data from backup device
- Transfer across network
- Write to disk
- Traditional backup software
- Prepare data to be backed up
- Typical filesystem
- Millions of files (image backups)
- User scripts
- Snapshots
- BCVs
- Applications
- Read data to be backed up
- First full backup
- File-level Incremental backups
- File-level Cumulative Incremental/Differential backups
- Block-level Incremental backups
- Applications
- Send data across a network
- WAN
- LAN
- iSCSI
- Fibre Channel
- Accept data from network
- WAN
- LAN
- iSCSI
- Fibre Channel
- Store data on backup device
- Backup formats
- The dump utility
- The tar, ditto, and cpio utilities
- Commercial backup formats
- Mainframe method (ind. files & aggregates used by TSM)
- Traditional backup to tape
- Traditional backup to local filesystem
- Traditional backup to NAS filesystem
- Traditional backup to VTL
- Increasing backup speed
- Multiplexing
- Multistreaming
- LAN-free data transfer
- Decrease backup load (duration, effect on servers/networks)
- Progressive incremental with no full
- Traditional incremental with virtual full backup
- Record where backup is
- Different kinds of databases
- Images, savesets and files
- Individual version tracking
- Create additional copies
- Why you should make copies
- Traditional device-to-device copy
- Cover disk to tape, tape to disk, small tapes to big tape, big tape to small tapes
- Backup mirroring (ITC)
- Cover mirroring to disk and tape
- Intelligent device
- Traditional tape out
- Third party copy controlled by backup software (e.g. NBU NDMP Direct Tape, open storage api)
- Protect the backup history
- Catalog backups
- Replication of catalog
- Send backup to another location
- Traditional tape and truck
- Encryption (overview, covered in detail elsewhere)
- Originals
- Copies
- IDT methods
- Usually not encrypted
- IDT cascading (no backup knowledge), usually eventually to tape
- IDT cascading (with backup control & knowledge), usually eventually to tape
- Collocating data for potential restore
- Grandfather/father/son with fulls/diffs or virtual fulls/diffs
- Progressive incremental
- Collocation (client, filespace, group, active collocation)
- Reclamation
- Expire old backups
- Expire image/saveset, all files in image expire
- Expire individual file entries
- Make old backup media available
- Fully expired tapes recycled into list of available tapes
- Special considerations when using VTLs (how space on tapes is reclaimed)
- Backup sent to filesystem deleted
- Reclamation of mostly expired tapes
- Migrate data between storage devices
- Disk to tape migration
- Tape to tape migration
- Request data for restore
- One request system
- Multi-request system (what arcserve does)
- Application restore request
- Bring media onsite if necessary
- Contact offsite storage vendor
- Request move of tape to onsite
- Decide between tomorrow, rush, super rush
- Prepare media
- Restore from tape
- May need to recall (should've made copies)
- Put into tape library
- Software mounts, addresses & reads
- Restore from IDT
- If in onsite IDT
- If only in offsite IDT
- Restores vs backups (suspension, prioritization over)
- Read data from backup device(s)
- Legacy style of complete full restore followed by complete inc. restore
- Typical restore that only restores necessary files (like NBU)
- Simultaneous restore from multiple tapes
- Simultaneous restores from one tape
- Single restore from multiplexed image (how it is often gated by other components)
- Transfer across network
- From backup server
- From local server
- Write to disk
- File-level
- Block-level
- De-dupe backup software
- Overview
- Including delta diff & hashing
- What its for (not for)
- Multi-tier system (remote only, local recovery server)
- Software as service, migrate to own hardware
- Prepare data to be backed up
- Identify changed files via mtime, utime, archive bit
- Identifying changed data in a database file
- Identify changed blocks in a changed file
- Delta differential method
- Hashing method
- Identify unique, changed blocks (hash dedupe)
- Read data to be backed up
- Read new, unique blocks
- Send data across a network
- All send via IP
- Accept data from network
- All receive via IP
- Store data on backup device
- De-dupe sw requires raw disk
- Increase backup speed
- Main decrease comes from reducing data transferred
- Biggest challenge is that it's very compute intensive
- Decrease backup load (duration, effect on servers/networks)
- Main decrease comes from reducing data transferred
- Biggest challenge is that it's very compute intensive
- Record where backup is
- Hash table for hash products
- Not sure how delta products store their data
- Create additional copies
- Done via replication into another unit
- Protect the backup history
- Really need to ask about how they protect the "catalog"
- Send backup to another location
- Replication
- Collocating data for potential restore
- Already collocated and ready for a restore
- Can collocate data with servers (local recovery server)
- Expire old backups
- Expiring a file reduces by one count the number of links to a given chunk
- When a given chunk is reduced to zero links, the chunk can be expired
- Need to explain how delta works.
- Make old backup media available
- Consists of deleting chunks with zero links & hash entries
- What about fragmentation?
- Migrate data between storage devices
- No need, although may move around for load balancing
- Request data for restore
- Similar to traditional backup. choose file/directory and time, or app interface
- Bring media onsite if necessary
- Most backups should be onsite already
- Discuss local recovery server
- Prepare media
- Files are built dynamically from pointers to blocks
- Read data from backup device
- Data is built on the fly
- Transfer across network
- All current systems transfer via IP
- Write to disk
- CDP
- Overview - explain true CDP
- Prepare data to be backed up
- Since we will be transferring everything, not really necessary
- Some CDP products will allow you to create a time marker (backup mode etc)
- Read data to be backed up
- Initial full backup - may take a while
- Continuous incremental -- as a byte changes, it is transferred to CDP server
- Explain difference between software data tap and hardware appliance
- Send data across a network
- WAN/LAN
- Usually software data tap, data is sent via IP to CDP server
- iSCSI/Fibre Channel
- Appliance-based systems can send via FC
- Some CDP products that read from appliance can send data via SCSI
- Accept data from network
- Same as above
- Store data on backup device
- Constant journal, no standby disk
- Standby disk with journal regularly applied
- Talk about what happens when it gets behind ("marketing only" mode)
- Increase backup speed
- Can't get any faster -- it's continuous
- Decrease backup load (duration, effect on servers/networks)
- Actually runs all the time, but doesn't impact app so ok
- Record where backup is
- Journal must be continually updated with current state and all blocks that got us here
- Collocating data for potential restore
- All data is located in a single array
- Create additional copies
- Replication
- Protect the backup history
- Protecting the journal -- how?
- Send backup to another location
- Replication
- Expire old backups
- Expire old journal entries and associated blocks
- Make old backup media available
- As old blocks are cleared, they're available for new blocks
- Migrate data between storage devices
- With constant journal method, nothing
- With full standby disk, regular transfer of new blocks
- Request data for restore
- Pick a point in time and a LUN to restore
- My pick signficant point in time as determined by app
- Bring media onsite if necessary
- Local recovery server
- Prepare media
- Create virtual LUN on fly - quick, but may be slower performance
- Roll standby LUN to appropriate point in time - takes longer, but better performance
- If you want, you can mount LUN in another place to check that it's cool
- Read data from backup device
- If using as standby, can point production app right at LUN
- If need to restore production LUN, can read only blocks that have changed since PIT
- Transfer across network
- If using as standby, blocks will transfer as requested
- Must transfer only those blocks that have changed
- Write to disk
- If using as standby, N/A
- If doing restore, can write only changed blocks (incremental restore)
- Special
- Explain how you can do both standby & restore, then switch
- Near-CDP
- Types of snapshots
- We're talking about virtual copies, not BCVs
- Different methods: copy on write, redirect on write, WAFL
- User accessible or not?
- Prepare data to be backed up
- If user data, no prep needed
- If app data, put app in backup mode or down
- Explain snapshot then replicate, replicate then snapshot
- Replicate then snapshot only good for user data
- Scheduled vs user script driven vs backup app driven
- Take snapshot
- Read data to be backed up
- Backup is now done, but must create another copy
- Going to read static images of blocks to send via network
- Send data across a network
- Typically IP/WAN/LAN
- Doesn't preclude transfer via FC/iSCSI
- Accept data from network
- Typically IP/WAN/LAN
- Doesn't preclude transfer via FC/iSCSI
- Store data on backup device
- On source system, must store blocks necessary to continue to present snapshot
- Explain how many blocks that may be
- Store blocks necessary to replicate all snapshots
- Increase backup speed
- Virtual backup instantaneous
- Simultaneous copies can increase repl. speed
- Decrease backup load (duration, effect on servers/networks)
- Same
- Record where backup is
- Self explanator format
- Some can work with MS VSS API
- Some backup apps can record content of snapshot in catalog
- Collocate data for restore
- It's all already there
- Create additional copies
- Replication
- Protect the backup history
- Self-contained
- Backup history can actually be rebuilt
- Send backup to another location
- Replication
- Migrate data between storage devices
- No need
- Collocating data for potential restore
- Already there
- Expire old backups
- Snapshots expired = return of space
- Some snaphots allow you to determine expiry time when created, then auto expired
- Can usually manually expire images out of order
- Backup app controlled can be expired by backup app
- Make old backup media available
- As snapshots expire, blocks used by them returned to storage
- Request data for restore
- Traditional way, just go the right snapshot directory
- Some can work with VSS "previous versions"
- If recorded in backup app, can do usual browse restore
- Bring media onsite if necessary
- Usually onsite already
- Prepare media
- No prep needed
- Read data from backup device
- Drag and drop files needed
- Previous versions VSS api will overwrite file with selected version
- Backup app will select previous version and tell snapshot software to restore it
- Transfer across network
- If restoring from replicated copy, may need to copy across network
- If restoring from same system, no need
- Write to disk
- Typically complete overwrite for file-level
- Some can do incremental restore if restoring entire volume
- Other concerns
- Protection of the backup index
- Most important backup you have
- BMR
- Explain what were not covering (free stuff)
- Explain commercial ways of handling
- Cloning/Ghosting
- Special BMR backup of everything
- Regular backup accompanied by BMR info
- ROBO backup
- Talk about evils of typical, tell stories of security guard swapping tapes
- Time to get rid of tape
- Use dedupe software or hardware
- Use regular software & dedupe hardware
- Use dedupe software
- Software as a service
- Network-Mounted Filesystems (just borrow from other book)
- Backup via NFS
- Local agent
- NDMP
- Databases
- Agent vs scripts
<<Go back to Main Outline
|
|