Reduce traverse time of large directory trees on Linux

Byetzion 14/12/202127/01/2023

Every Linux admin is familiar with the long time running through a large directory tree (with hundred of thousands of files and more) can take. Most are aware that if you re-run the same run-through, it will be shorter.

This is caused by a short-valid filesystem cache, where the memory is allocated to other tasks, or the metadata required cache exceeds the available for this task.

If the system is focused on files, meaning that its prime task is holding files (like NFS server, for example) and the memory is largely available, a certain tunable can reduce recurring directory dives (like the ‘find’ or ‘rsync’ commands, which run huge amounts of attribute queries):

sysctl vm.vfs_cache_pressure=10

The default value is 100. Lower values will cause the system to prefer keeping this cache. A quote from kernel’s memory tunables page:

vfs_cache_pressure
——————————–
This percentage value controls the tendency of the kernel to reclaim the memory which is used for caching of directory and inode objects.

At the default value of vfs_cache_pressure=100 the kernel will attempt to reclaim dentries and inodes at a “fair” rate with respect to pagecache and swapcache reclaim. Decreasing vfs_cache_pressure causes the kernel to prefer to retain dentry and inode caches. When vfs_cache_pressure=0, the kernel will never reclaim dentries and inodes due to memory pressure and this can easily lead to out-of-memory conditions. Increasing vfs_cache_pressure beyond 100 causes the kernel to prefer to reclaim dentries and inodes.

Increasing vfs_cache_pressure significantly beyond 100 may have negative performance impact. Reclaim code needs to take various locks to find freeable directory and inode objects. With vfs_cache_pressure=1000, it will look for ten times more freeable objects than there are.

Laptop | Linux

i810 dual-pipe issues with power management

Byetzion 09/05/2008

I have had a problem with my IBM X41 – ever since I have started using Ubuntu 7.10 (after a nice upgrade from 7.04), whenever the lid was closed, and reopened – the display would have flickered for a short while (while the lid is up) and then blank completely. My (ugly) workaround was to…

Linux

IPSec VPN for mobile devices on Linux

Byetzion 08/12/2012

I have had recently the pleasure and challenge of setting up VPN server for mobile devices on top of Linux. the common method to do so would be by using IPSec + L2TP, as these are to more common methods mobile devices allow, and it should work quite fine with other types of clients (although…

Linux | Virtualization

After a very long absence, Changing Linux HVM to PV on Xen

Byetzion 08/05/2010

I have had a stressed time, and had no time to actually write down anything here. This is pity, since I have been doing so many things worth sharing. I will start with a small one now – how to convert a physical machine into Xen-based VM. I assume you know the drill of how…

Clusters | Disk Storage | RedHat Cluster | Virtualization

RedHat cluster on RHEL6 and KVM-based VMs

Byetzion 01/08/2012

The concept of running a virtual machine, KVM-based, in this case, under RHCS is acceptable and reasonable. The interesting part is that the <vm/> directive replaces the <service/> directive and acts as a high-level directive for VMs. This allows for things which cannot be performed with regular ‘service’, such as live migration. There are probably…

Linux

Hello Edgy, goodbye Feisty!

Byetzion 03/07/2007

I had a rough experience upgrading Edgy to Feisy. Hardware listing, for generations to come: Motherboard Abit IB9, including on-board IDE controller IT8211 with firmware 1.3.1.67 and no BIOS upgrade in the near future. Nvidia 7700 with 256MB RAM on PCIe Problems: 1. With IDE controller enabled in BIOS system hangs during startup. This is…

Disk Storage | Scripting/Programming

Cacti NetApp Ontap API data query

Byetzion 23/06/201311/07/2015

I have been using the excellent template and scripts from this forum post, however, when the NetApp device is loaded with LUNs and volumes, the script will cause the Cacti to timeout, and during that time, consume CPU. The original cause of this problem was a workaround to some NetApp Perl API bug the original…

Related posts:

Similar Posts

Leave a Reply Cancel reply