File system fragmentation: Difference between revisions

Content deleted Content added
Related file fragmentation: Why techniques fail to effective
Line 22:
====Related file fragmentation====
Related file fragmentation refers to the lack of [[locality of reference]] between related files. Unlike the previous two types of fragmentation, related file fragmentation is a much more vague concept, as it heavily depends on the access pattern of specific applications. This also makes objectively measuring or estimating it very difficult. However, arguably, it is the most critical type of fragmentation, as studies have found that the most frequently accessed files tend to be small compared to available disk throughput per second.<ref name=filesys-contents>{{cite journal |author=John R. Douceur, William J. Bolosky |date=1999-06 |title=A Large-Scale Study of File-System Contents |publisher=[[Microsoft Research]] |pages=pages&nbsp;59&ndash;70 |journal=[[ACM SIGMETRICS]] Performance Evaluation Review |volume=volume&nbsp;27 |issue=issue&nbsp;1 |url=http://research.microsoft.com/~bolosky/papers/SIGMETRICS99/filesys.pdf |format=[[PDF]] |issn=0163-5999 |accessdate=2006-12-14 }}</ref>
 
To avoid related file fragmentation and improve locality of reference, assumptions about the operation of applications have to be made. A very frequent assumption made is that it is worthwhile to keep smaller files within a single directory together, and lay them out in the natural file system order. While it is often a reasonable assumption, it does not always hold. For example, an application might read several different files, perhaps in different directories, in the exact same order they were written. Thus, a file system that does not reorder writes at all, might work faster for the given application.
 
==Techniques for mitigating fragmentation==