Indexing in Hadoop has two different levels.
1. Index based on File URI: In this case data is indexed based on different files. When we search for data, index will return the files that contain the data.
2. Index based on InputSplit: In this case, data is indexed based on
locations where input split is located.