Inicio
Productos
Red Hat Enterprise Linux
7
Storage Administration Guide
31.3. Data Efficiency Testing Procedures

Este contenido no está disponible en el idioma seleccionado.

31.3. Data Efficiency Testing Procedures

Successful validation of VDO is dependent upon following a well-structured test procedure. This section provides a series of steps to follow, along with the expected results, as examples of tests to consider when participating in an evaluation.

Test Environment

The test cases in the next section make the following assumptions about the test environment:

One or more Linux physical block devices are available.
The target block device (for example, /dev/sdb) is larger than 512 GB.
Flexible I/O Tester (fio) version 2.1.1 or later is installed.
VDO is installed.

The following information should be recorded at the start of each test in order to ensure that the test environment is fully understood:

The Linux build used, including the kernel build number.
A complete list of installed packages, as obtained from the rpm -qa command.
Complete system specifications:
- CPU type and quantity (available in /proc/cpuinfo).
- Installed memory and the amount available after the base OS is running (available in /proc/meminfo).
- Type(s) of drive controller(s) used.
- Type(s) and quantity of disk(s) used.
A complete list of running processes (from ps aux or a similar listing).
Name of the Physical Volume and the Volume Group created for use with VDO (pvs and vgs listings).
File system used when formatting the VDO volume (if any).
Permissions on the mounted directory.
Contents of /etc/vdoconf.yaml.
Location of the VDO files.

You can capture much of the required information by running sosreport.

Workloads

Effectively testing VDO requires the use of data sets that simulate real world workloads. The data sets should provide a balance between data that can be deduplicated and/or compressed and data that cannot in order to demonstrate performance under different conditions.

There are several tools that can synthetically generate data with repeatable characteristics. Two utilities in particular, VDbench and fio, are recommended for use during testing.

This guide uses fio. Understanding the arguments is critical to a successful evaluation:

Expand

Table 31.1. fio Options
Argument	Description	Value
`--size`	The quantity of data fio will send to the target per job (see `numjobs` below).	100 GB
`--bs`	The block size of each read/write request produced by fio. Red Hat recommends a 4 KB block size to match VDO's 4 KB default	4k
`--numjobs`	The number of jobs that fio will create to run the benchmark. Each job sends the amount of data specified by the `--size` parameter. The first job sends data to the device at the offset specified by the `--offset` parameter. Subsequent jobs write the same region of the disk (overwriting) unless the `--offset_increment` parameter is provided, which will offset each job from where the previous job began by that value. To achieve peak performance on flash at least two jobs are recommended. One job is typically enough to saturate rotational disk (HDD) throughput.	1 (HDD) 2 (SSD)
`--thread`	Instructs fio jobs to be run in threads rather than being forked, which may provide better performance by limiting context switching.	<N/A>
`--ioengine`	There are several I/O engines available in Linux that are able to be tested using fio. Red Hat testing uses the asynchronous unbuffered engine (`libaio`). If you are interested in another engine, discuss that with your Red Hat Sales Engineer. The Linux `libaio` engine is used to evaluate workloads in which one or more processes are making random requests simultaneously. `libaio` allows multiple requests to be made asynchronously from a single thread before any data has been retrieved, which limits the number of context switches that would be required if the requests were provided by manythreads via a synchronous engine.	libaio
`--direct`	When set, direct allows requests to be submitted to the device bypassing the Linux Kernel's page cache. Libaio Engine: `libaio` must be used with `direct` enabled (=1) or the kernel may resort to the `sync` API for all I/O requests.	1 (libaio)
`--iodepth`	The number of I/O buffers in flight at any time. A high `iodepth` will usually increase performance, particularly for random reads or writes. High depths ensure that the controller always has requests to batch. However, setting `iodepth` too high (greater than 1K, typically) may cause undesirable latency. While Red Hat recommends an `iodepth` between 128 and 512, the final value is a trade-off and depends on how your application tolerates latency.	128 (minimum)
`--iodepth_batch_submit`	The number of I/Os to create when the `iodepth` buffer pool begins to empty. This parameter limits task switching from I/O to buffer creation during the test.	16
`--iodepth_batch_complete`	The number of I/Os to complete before submitting a batch (`iodepth_batch_complete`). This parameter limits task switching from I/O to buffer creation during the test.	16
`--gtod_reduce`	Disables time-of-day calls to calculate latency. This setting will lower throughput if enabled (=0), so it should be enabled (=1) unless latency measurement is necessary.	1

31.3.1. Configuring a VDO Test Volume
Copiar enlace

1. Create a VDO Volume with a Logical Size of 1 TB on a 512 GB Physical Volume
Copiar enlace

Create a VDO volume.

To test the VDO async mode on top of synchronous storage, create an asynchronous volume using the --writePolicy=async option:

vdo create --name=vdo0 --device=/dev/sdb \
             --vdoLogicalSize=1T --writePolicy=async --verbose

# vdo create --name=vdo0 --device=/dev/sdb \
             --vdoLogicalSize=1T --writePolicy=async --verbose

Copy to Clipboard

Toggle word wrap

To test the VDO sync mode on top of synchronous storage, create a synchronous volume using the --writePolicy=sync option:

vdo create --name=vdo0 --device=/dev/sdb \
             --vdoLogicalSize=1T --writePolicy=sync --verbose

# vdo create --name=vdo0 --device=/dev/sdb \
             --vdoLogicalSize=1T --writePolicy=sync --verbose

Copy to Clipboard

Toggle word wrap

Format the new device with an XFS or ext4 file system.
- For XFS:
  # mkfs.xfs -K /dev/mapper/vdo0
  Copy to Clipboard Toggle word wrap
- For ext4:
  # mkfs.ext4 -E nodiscard /dev/mapper/vdo0
  Copy to Clipboard Toggle word wrap

Mount the formatted device:

mkdir /mnt/VDOVolume
mount /dev/mapper/vdo0 /mnt/VDOVolume && \
  chmod a+rwx /mnt/VDOVolume

# mkdir /mnt/VDOVolume
# mount /dev/mapper/vdo0 /mnt/VDOVolume && \
  chmod a+rwx /mnt/VDOVolume

Copy to Clipboard

Toggle word wrap

31.3.2. Testing VDO Efficiency
Copiar enlace

2. Test Reading and Writing to the VDO Volume
Copiar enlace

Write 32 GB of random data to the VDO volume:

dd if=/dev/urandom of=/mnt/VDOVolume/testfile bs=4096 count=8388608

$ dd if=/dev/urandom of=/mnt/VDOVolume/testfile bs=4096 count=8388608

Copy to Clipboard

Toggle word wrap

Read the data from the VDO volume and write it to another location not on the VDO volume:
```
dd if=/mnt/VDOVolume/testfile of=/home/user/testfile bs=4096
```
```
$ dd if=/mnt/VDOVolume/testfile of=/home/user/testfile bs=4096
```
Copy to Clipboard Toggle word wrap
Compare the two files using diff, which should report that the files are the same:
```
diff -s /mnt/VDOVolume/testfile /home/user/testfile
```
```
$ diff -s /mnt/VDOVolume/testfile /home/user/testfile
```
Copy to Clipboard Toggle word wrap

Copy the file to a second location on the VDO volume:

dd if=/home/user/testfile of=/mnt/VDOVolume/testfile2 bs=4096

$ dd if=/home/user/testfile of=/mnt/VDOVolume/testfile2 bs=4096

Copy to Clipboard

Toggle word wrap

Compare the third file to the second file. This should report that the files are the same:
```
diff -s /mnt/VDOVolume/testfile2 /home/user/testfile
```
```
$ diff -s /mnt/VDOVolume/testfile2 /home/user/testfile
```
Copy to Clipboard Toggle word wrap

3. Remove the VDO Volume
Copiar enlace

Unmount the file system created on the VDO volume:
```
umount /mnt/VDOVolume
```
```
# umount /mnt/VDOVolume
```
Copy to Clipboard Toggle word wrap
Run the command to remove the VDO volume vdo0 from the system:
```
vdo remove --name=vdo0
```
```
# vdo remove --name=vdo0
```
Copy to Clipboard Toggle word wrap
Verify that the volume has been removed. There should be no listing in vdo list for the VDO partition:
```
vdo list --all | grep vdo
```
```
# vdo list --all | grep vdo
```
Copy to Clipboard Toggle word wrap

4. Measure Deduplication
Copiar enlace

Create and mount a VDO volume following Section 31.3.1, “Configuring a VDO Test Volume”.
Create 10 directories on the VDO volume named vdo1 through vdo10 to hold 10 copies of the test data set:
```
mkdir /mnt/VDOVolume/vdo{01..10}
```
```
$ mkdir /mnt/VDOVolume/vdo{01..10}
```
Copy to Clipboard Toggle word wrap
Examine the amount of disk space used according to the file system:
```
df -h /mnt/VDOVolume

Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vdo0      1.5T  198M  1.4T   1% /mnt/VDOVolume
```
```
$ df -h /mnt/VDOVolume

Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vdo0      1.5T  198M  1.4T   1% /mnt/VDOVolume
```
Copy to Clipboard Toggle word wrap
Consider tabulating the results in a table:
Expand
Statistic Bare File System After Seed After 10 Copies
File System Used Size 198 MB
VDO Data Used
VDO Logical Used
Run the following command and record the values. "Data blocks used" is the number of blocks used by user data on the physical device running under VDO. "Logical blocks used" is the number of blocks used before optimization. It will be used as the starting point for measurements
```
vdostats --verbose | grep "blocks used"

data blocks used                : 1090
overhead blocks used            : 538846
logical blocks used             : 6059434
```
```
# vdostats --verbose | grep "blocks used"

data blocks used                : 1090
overhead blocks used            : 538846
logical blocks used             : 6059434
```
Copy to Clipboard Toggle word wrap

Create a data source file in the top level of the VDO volume

dd if=/dev/urandom of=/mnt/VDOVolume/sourcefile bs=4096 count=1048576

4294967296 bytes (4.3 GB) copied, 540.538 s, 7.9 MB/s

$ dd if=/dev/urandom of=/mnt/VDOVolume/sourcefile bs=4096 count=1048576

4294967296 bytes (4.3 GB) copied, 540.538 s, 7.9 MB/s

Copy to Clipboard

Toggle word wrap

Re-examine the amount of used physical disk space in use. This should show an increase in the number of blocks used corresponding to the file just written:

df -h /mnt/VDOVolume

Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vdo0      1.5T  4.2G  1.4T   1% /mnt/VDOVolume

$ df -h /mnt/VDOVolume

Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vdo0      1.5T  4.2G  1.4T   1% /mnt/VDOVolume

Copy to Clipboard

Toggle word wrap

vdostats --verbose | grep "blocks used"

data blocks used                : 1050093 (increased by 4GB)
overhead blocks used            : 538846 (Did not change)
logical blocks used             : 7108036 (increased by 4GB)

# vdostats --verbose | grep "blocks used"

data blocks used                : 1050093 (increased by 4GB)
overhead blocks used            : 538846 (Did not change)
logical blocks used             : 7108036 (increased by 4GB)

Copy to Clipboard

Toggle word wrap

Copy the file to each of the 10 subdirectories:

for i in {01..10}; do
  cp /mnt/VDOVolume/sourcefile /mnt/VDOVolume/vdo$i
  done

$ for i in {01..10}; do
  cp /mnt/VDOVolume/sourcefile /mnt/VDOVolume/vdo$i
  done

Copy to Clipboard

Toggle word wrap

Once again, check the amount of physical disk space used (data blocks used). This number should be similar to the result of step 6 above, with only a slight increase due to file system journaling and metadata:

df -h /mnt/VDOVolume

Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vdo0      1.5T   45G  1.3T   4% /mnt/VDOVolume

$ df -h /mnt/VDOVolume

Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vdo0      1.5T   45G  1.3T   4% /mnt/VDOVolume

Copy to Clipboard

Toggle word wrap

vdostats --verbose | grep "blocks used"

data blocks used                : 1050836 (increased by 3M)
overhead blocks used            : 538846
logical blocks used             : 17594127 (increased by 41G)

# vdostats --verbose | grep "blocks used"

data blocks used                : 1050836 (increased by 3M)
overhead blocks used            : 538846
logical blocks used             : 17594127 (increased by 41G)

Copy to Clipboard

Toggle word wrap

Subtract this new value of the space used by the file system from the value found before writing the test data. This is the amount of space consumed by this test from the file system's perspective.
Observe the space savings in your recorded statistics:
Note:In the following table, values have been converted to MB/GB. vdostats "blocks" are 4,096 B.
Expand
Statistic Bare File System After Seed After 10 Copies
File System Used Size 198 MB 4.2 GB 45 GB
VDO Data Used 4 MB 4.1 GB 4.1 GB
VDO Logical Used 23.6 GB* 27.8 GB 68.7 GB
* File system overhead for 1.6 TB formatted drive

Statistic	Bare File System	After Seed	After 10 Copies
File System Used Size	198 MB	4.2 GB	45 GB
VDO Data Used	4 MB	4.1 GB	4.1 GB
VDO Logical Used	23.6 GB*	27.8 GB	68.7 GB

5. Measure Compression
Copiar enlace

Create a VDO volume of at least 10 GB of physical and logical size. Add options to disable deduplication and enable compression:

vdo create --name=vdo0 --device=/dev/sdb \
             --vdoLogicalSize=10G --verbose \
             --deduplication=disabled --compression=enabled

# vdo create --name=vdo0 --device=/dev/sdb \
             --vdoLogicalSize=10G --verbose \
             --deduplication=disabled --compression=enabled

Copy to Clipboard

Toggle word wrap

Inspect VDO statistics before transfer; make note of data blocks used and logical blocks used (both should be zero):
```
vdostats --verbose | grep "blocks used"
```
```
# vdostats --verbose | grep "blocks used"
```
Copy to Clipboard Toggle word wrap
Format the new device with an XFS or ext4 file system.
- For XFS:
  # mkfs.xfs -K /dev/mapper/vdo0
  Copy to Clipboard Toggle word wrap
- For ext4:
  # mkfs.ext4 -E nodiscard /dev/mapper/vdo0
  Copy to Clipboard Toggle word wrap

Mount the formatted device:

mkdir /mnt/VDOVolume
mount /dev/mapper/vdo0 /mnt/VDOVolume && \
  chmod a+rwx /mnt/VDOVolume

# mkdir /mnt/VDOVolume
# mount /dev/mapper/vdo0 /mnt/VDOVolume && \
  chmod a+rwx /mnt/VDOVolume

Copy to Clipboard

Toggle word wrap

Synchronize the VDO volume to complete any unfinished compression:
```
sync && dmsetup message vdo0 0 sync-dedupe
```
```
# sync && dmsetup message vdo0 0 sync-dedupe
```
Copy to Clipboard Toggle word wrap
Inspect VDO statistics again. Logical blocks used — data blocks used is the number of 4 KB blocks saved by compression for the file system alone. VDO optimizes file system overhead as well as actual user data:
```
vdostats --verbose | grep "blocks used"
```
```
# vdostats --verbose | grep "blocks used"
```
Copy to Clipboard Toggle word wrap

Copy the contents of /lib to the VDO volume. Record the total size:

cp -vR /lib /mnt/VDOVolume

...
sent 152508960 bytes  received 60448 bytes  61027763.20 bytes/sec
total size is 152293104  speedup is 1.00

# cp -vR /lib /mnt/VDOVolume

...
sent 152508960 bytes  received 60448 bytes  61027763.20 bytes/sec
total size is 152293104  speedup is 1.00

Copy to Clipboard

Toggle word wrap

Synchronize Linux caches and the VDO volume:
```
sync && dmsetup message vdo0 0 sync-dedupe
```
```
# sync && dmsetup message vdo0 0 sync-dedupe
```
Copy to Clipboard Toggle word wrap
Inspect VDO statistics once again. Observe the logical and data blocks used:
```
vdostats --verbose | grep "blocks used"
```
```
# vdostats --verbose | grep "blocks used"
```
Copy to Clipboard Toggle word wrap
- Logical blocks used - data blocks used represents the amount of space used (in units of 4 KB blocks) for the copy of your /lib files.
- The total size (from the table in the section called “4. Measure Deduplication”) - (logical blocks used-data blocks used * 4096) = bytes saved by compression.

Remove the VDO volume:

umount /mnt/VDOVolume && vdo remove --name=vdo0

# umount /mnt/VDOVolume && vdo remove --name=vdo0

Copy to Clipboard

Toggle word wrap

6. Test VDO Compression Efficiency
Copiar enlace

Create and mount a VDO volume following Section 31.3.1, “Configuring a VDO Test Volume”.
Repeat the experiments in the section called “4. Measure Deduplication” and the section called “5. Measure Compression” without removing the volume. Observe changes to space savings in vdostats.
Experiment with your own datasets.

7. Understanding TRIM and DISCARD
Copiar enlace

Thin provisioning allows a logical or virtual storage space to be larger than the underlying physical storage. Applications such as file systems benefit from running on the larger virtual layer of storage, and data-efficiency techniques such as data deduplication reduce the number of physical data blocks needed to store all of the data. To benefit from these storage savings, the physical storage layer needs to know when application data has been deleted.

Traditional file systems did not have to inform the underlying storage when data was deleted. File systems that work with thin provisioned storage send TRIM or DISCARD commands to inform the storage system when a logical block is no longer required. These commands can be sent whenever a block is deleted using the discard mount option, or these commands can be sent in a controlled manner by running utilities such as fstrim that tell the file system to detect which logical blocks are unused and send the information to the storage system in the form of a TRIM or DISCARD command.

Important

For more information on how thin provisioning works, see Thinly-Provisioned Logical Volumes (Thin Volumes) in the Red Hat Enterprise Linux 7 Logical Volume Manager Administration Guide.

To see how this works:

Create and mount a new VDO logical volume following Section 31.3.1, “Configuring a VDO Test Volume”.
Trim the file system to remove any unneeded blocks (this may take a long time):
```
fstrim /mnt/VDOVolume
```
```
# fstrim /mnt/VDOVolume
```
Copy to Clipboard Toggle word wrap
Record the initial state in following table below by entering:
```
df -m /mnt/VDOVolume
```
```
$ df -m /mnt/VDOVolume
```
Copy to Clipboard Toggle word wrap
to see how much capacity is used in the file system, and run vdostats to see how many physical and logical data blocks are being used.
Create a 1 GB file with non-duplicate data in the file system running on top of VDO:
```
dd if=/dev/urandom of=/mnt/VDOVolume/file bs=1M count=1K
```
```
$ dd if=/dev/urandom of=/mnt/VDOVolume/file bs=1M count=1K
```
Copy to Clipboard Toggle word wrap
and then collect the same data. The file system should have used an additional 1 GB, and the data blocks used and logical blocks used have increased similarly.
Run fstrim /mnt/VDOVolume and confirm that this has no impact after creating a new file.
Delete the 1 GB file:
```
rm /mnt/VDOVolume/file
```
```
$ rm /mnt/VDOVolume/file
```
Copy to Clipboard Toggle word wrap
Check and record the parameters. The file system is aware that a file has been deleted, but there has been no change to the number of physical or logical blocks because the file deletion has not been communicated to the underlying storage.
Run fstrim /mnt/VDOVolume and record the same parameters. fstrim looks for free blocks in the file system and sends a TRIM command to the VDO volume for unused addresses, which releases the associated logical blocks, and VDO processes the TRIM to release the underlying physical blocks.
Expand
Step File Space Used (MB) Data Blocks Used Logical Blocks Used
Initial
Add 1 GB File
Run fstrim
Delete 1 GB File
Run fstrim

From this exercise, the TRIM process is needed so the underlying storage can have an accurate knowledge of capacity utilization. fstrim is a command line tool that analyzes many blocks at once for greater efficiency. An alternative method is to use the file system discard option when mounting. The discard option will update the underlying storage after each file system block is deleted, which can slow throughput but provides for great utilization awareness. It is also important to understand that the need to TRIM or DISCARD unused blocks is not unique to VDO; any thin-provisioned storage system has the same challenge

Volver arriba

Este contenido no está disponible en el idioma seleccionado.

31.3. Data Efficiency Testing Procedures

Test Environment

Workloads

31.3.1. Configuring a VDO Test Volume
Copiar enlace

1. Create a VDO Volume with a Logical Size of 1 TB on a 512 GB Physical Volume
Copiar enlace

31.3.2. Testing VDO Efficiency
Copiar enlace

2. Test Reading and Writing to the VDO Volume
Copiar enlace

3. Remove the VDO Volume
Copiar enlace

4. Measure Deduplication
Copiar enlace

5. Measure Compression
Copiar enlace

6. Test VDO Compression Efficiency
Copiar enlace

7. Understanding TRIM and DISCARD
Copiar enlace

Aprender

Pruebe, compre y venda

Comunidades

Acerca de la documentación de Red Hat

Hacer que el código abierto sea más inclusivo

Acerca de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Step	File Space Used (MB)	Data Blocks Used	Logical Blocks Used
Initial
Add 1 GB File
Run `fstrim`
Delete 1 GB File
Run `fstrim`

Este contenido no está disponible en el idioma seleccionado.

31.3. Data Efficiency Testing Procedures

Test Environment

Workloads

31.3.1. Configuring a VDO Test VolumeCopiar enlaceEnlace copiado en el portapapeles!

1. Create a VDO Volume with a Logical Size of 1 TB on a 512 GB Physical VolumeCopiar enlaceEnlace copiado en el portapapeles!

31.3.2. Testing VDO EfficiencyCopiar enlaceEnlace copiado en el portapapeles!

2. Test Reading and Writing to the VDO VolumeCopiar enlaceEnlace copiado en el portapapeles!

3. Remove the VDO VolumeCopiar enlaceEnlace copiado en el portapapeles!

4. Measure DeduplicationCopiar enlaceEnlace copiado en el portapapeles!

5. Measure CompressionCopiar enlaceEnlace copiado en el portapapeles!

6. Test VDO Compression EfficiencyCopiar enlaceEnlace copiado en el portapapeles!

7. Understanding TRIM and DISCARDCopiar enlaceEnlace copiado en el portapapeles!

Aprender

Pruebe, compre y venda

Comunidades

Acerca de la documentación de Red Hat

Hacer que el código abierto sea más inclusivo

Acerca de Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

31.3.1. Configuring a VDO Test Volume
Copiar enlace

1. Create a VDO Volume with a Logical Size of 1 TB on a 512 GB Physical Volume
Copiar enlace

31.3.2. Testing VDO Efficiency
Copiar enlace

2. Test Reading and Writing to the VDO Volume
Copiar enlace

3. Remove the VDO Volume
Copiar enlace

4. Measure Deduplication
Copiar enlace

5. Measure Compression
Copiar enlace

6. Test VDO Compression Efficiency
Copiar enlace

7. Understanding TRIM and DISCARD
Copiar enlace