On systems with many GRID storages, replicas of the same datasets might exist. A job, requiring some files of this dataset, needs to decide where to retrieve them from, when multiple options exist. This also has an implication on caching the file, since the data could be cached from multiple sources. The logic for the cache might be different from the job read action.