This article lists data recovery and undeletion options for Linux.
Before you start
This page is mostly intended to be used for educational purposes. If you have accidentally deleted or otherwise damaged your valuable and irreplaceable data and have no previous experience with data recovery, turn off your computer immediately (Just press and hold the off button or pull the plug; do not use the system shutdown function) and seek professional help. It is quite possible and even probable that, if you follow any of the steps described below without fully understanding them, you will worsen your situation.
In the area of data recovery, it is best to work on images of disks rather than physical disks themselves. Generally, a failing drive's condition worsens over time. The goal ought to be to first rescue as much data as possible as early as possible in the failure of the disk and to then abandon the disk. The
dd, will repeatedly try to recover from errors and will read the drive front to back, then back to front, attempting to salvage data. They keep log files so that recovery can be paused and resumed without losing progress.
See Disk cloning.
The image files created from a utility like ddrescue can then be mounted like a physical device and can be worked on safely. Always make a copy of the original image so that you can revert if things go sour!
A tried and true method of improving failing drive reads is to keep the drive cold. A bit of time in the freezer is appropriate, but be careful to avoid bringing the drive from cold to warm too quickly, as condensation will form. Keeping the drive in the freezer with cables connected to the recovering PC works great.
Do not attempt a filesystem check on a failing drive, as this will likely make the problem worse. Mount it read-only.
Backup flash media/small partitions
As an alternative to working with a 'live' partition (mounted or not), it is often preferable to work with an image, provided that the filesystem in question is not too large and that you have sufficient free HDD space to accommodate the image file. For example, flash memory devices like thumb drives, digital cameras, portable music players, cellular phones, etc. are likely to be small enough to image in many cases.
Be sure to read the man pages for the utilities listed below to verify that they are capable of working with image files.
To make an image, one can use
dd as follows:
# dd if=/dev/target_partition of=/home/user/partition.image
Working with digital cameras
In order for some of the utilities listed in the next section to work with flash media, the device in question needs to be mounted as a block device (i.e., listed under /dev). Digital cameras operating in PTP (Picture Transfer Protocol) mode will not work in this regard. PTP cameras are transparently handled by libgphoto and/or libptp. In this case, "transparently" means that PTP devices do not get block devices. The alternative to PTP mode, USB Mass Storage (UMS) mode, is not supported by all cameras. Some cameras have a menu item that allows switching between the two modes; refer to your camera's user manual. If your camera does not support UMS mode and therefore cannot be accessed as a block device, your only alternative is to use a flash media reader and physically remove the storage media from your camera.
List of utilities
- dvdisaster — Additional error protection for CD/DVD media.
- ext4magic — recover deleted or overwritten files on ext3 and ext4 filesystems.
- Foremost — Console program to recover files based on their headers, footers, and internal data structures. This process is commonly referred to as data carving. The headers and footers can be specified by a configuration file or command line switches can be used to specify built-in file types.
- PhotoRec — File data recovery software designed to recover lost files including video, documents and archives from hard disks, CD-ROMs, and lost pictures (thus the Photo Recovery name) from digital camera memory.
- Scalpel — File carving and indexing application originally based on Foremost, although significantly more efficient. It allows an examiner to specify a number of headers and footers to recover filetypes from a piece of media.
- TestDisk — Data recovery software primarily designed to help recover lost partitions and/or make non-booting disks bootable again when these symptoms are caused by faulty software: certain types of viruses or human error (such as accidentally deleting a Partition Table).
is another recovery tool for the ext3 and ext4 file system.
To recover all files, deleted in the last 24 hours:
# ext4magic /dev/sdXY -r
To recover a directory or file:
# ext4magic /dev/sdXY -f path/to/lost/file -r
The small R flag
-r will only recover complete files, that were not overwritten.
To also recover broken files, that were partially overwritten, use the big R flag
This will also restore not-deleted files and empty directories.
The default destination is
which can be changed by adding the option
If a file exists in the destination directory,
the new file is renamed with a trailing hash sign
To recover files deleted after 'five days ago':
# ext4magic /dev/sdXY -f path/to/lost/file -a $(date -d -5days +%s) -r
To use a file list:
# ext4magic /dev/sdXY -f path/to/lost/file -Lx | grep -a ^--- >recovery-files-big.txt # ext4magic /dev/sdXY -i recovery-files-big.txt -R # ext4magic /dev/sdXY -f path/to/lost/file -lx | grep -a '^ 100%' >recovery-files-small.txt # ext4magic /dev/sdXY -i recovery-files-small.txt -r
The difference between the big L flag
-L and the small L flag
is the same as between the two R flags
-r (see above).
grep -a to preserve binary file names.
Using a file list allows to filter the files, for example by file extension:
# cat recovery-files-big.txt | grep -a '\.jpg"$' >recovery-files-big-jpg.txt
... or to split the file list:
# cat recovery-files-big.txt | split -l 100 - recovery-files-big-100-each-
Testdisk and PhotoRec
TestDisk and Photorec are both open-source data recovery utilities licensed under the terms of the GNU Public License (GPL).
TestDisk is primarily designed to help recover lost partitions and/or make non-booting disks bootable again when these symptoms are caused by faulty software, certain types of viruses, or human error, such as the accidental deletion of partition tables.
PhotoRec is file recovery software designed to recover lost files including photographs (Hint: PhotographRecovery), videos, documents, archives from hard disks and CD-ROMs. PhotoRec ignores the filesystem and goes after the underlying data, so it will still work even with a re-formatted or severely damaged filesystems and/or partition tables.
Install the package, which provides both TestDisk and PhotoRec.
After running e.g.
photorec image.img will open a terminal UI where you can select what file types to search for and where to put the recovered files.
Files recovered by photorec
The photorec utility stores recovered files with a random names(for most of the files) under a numbered directories, e.g.
- How to get the original filenames: PhotoRec FAQ
- Wiki (TestDisk): http://www.cgsecurity.org/wiki/TestDisk
- Wiki (Photorec): http://www.cgsecurity.org/wiki/PhotoRec
- Homepage: http://www.cgsecurity.org/
e2fsck is the ext2/ext3 filesystem checker included in the base install of Arch. e2fsck relies on a valid superblock. A superblock is a description of the entire filesystem's parameters. Because this data is so important, several copies of the superblock are distributed throughout the partition. With the
-b option, e2fsck can take an alternate superblock argument; this is useful if the main, first superblock is damaged.
To determine where the superblocks are, run
dumpe2fs -h on the target, unmounted partition. Superblocks are spaced differently depending on the filesystem's blocksize, which is set when the filesystem is created.
An alternate method to determine the locations of superblocks is to use the -n option with mke2fs. Be sure to use the
-n flag, which, according to the
mke2fs manpage, "Causes mke2fs to not actually create a filesystem, but display what it would do if it were to create a filesystem. This can be used to determine the location of the backup superblocks for a particular filesystem, so long as the mke2fs parameters that were passed when the filesystem was originally created are used again. (With the -n option added, of course!)".
dumpe2fs are included in the base Arch install as part of .
See alsoand .
Working with raw disk images
If you have backed up a drive using ddrescue or dd and you need to mount this image as a physical drive, see this section.
Mount the entire disk
To mount a complete disk image to the next free loop device, use the
# losetup -f -P /path/to/image
-fflag mounts the image to the next available loop device.
-Pflag creates additional devices for every partition.
In order to be able to mount a partiton of a whole disk image, follow the steps above.
Once the whole disk image is mounted, a normal
mount command can be used on the loop device:
# mount /dev/loop0p1 /mnt/example
This command mounts the first partition of the image in loop0 to the folder to the mountpoint
/mnt/example. Remember that the mountpoint directory must exist!
Getting disk geometry
Once the entire disk image has been mounted as a loopback device, its drive layout can be inspected.
Using QEMU to repair NTFS
With a disk image that contains one or more NTFS partitions that need to be
chkdsked by Windows since no good NTFS filesystem checker for Linux exists, QEMU can use a raw disk image as a real hard disk inside a virtual machine:
# qemu -hda /path/to/primary.img -hdb /path/to/DamagedDisk.img
Then, assuming Windows is installed on
primary.img, it can be used to check partitions on
Text file recovery
It is possible to find deleted plain text files on a hard drive by directly searching on the block device. A preferably unique string from the file you are trying to recover is needed.
grep to search for fixed strings (
-F) directly on the partition:
$ grep -a -C 200 -F 'Unique string in text file' /dev/sdXN > OutputFile
Hopefully, the content of the deleted file is now in OutputFile, which can be extracted from the surrounding context manually.
-C 200option tells grep to print 200 lines of context from before and after each match of the string. Alternatives are the
-Bflags, which print context only from after and before each match, respectively. You may need to adjust the number of lines if the file you are looking for is very long.
- Data Recovery on the Ubuntu wiki