The extra read-only data needed by a mapreduce job to process the main data set is called as side data
Hi..Side data.
The extra read-only data needed by a mapreduce job to process the main data set is called as side data.There are two ways to make side data available to all the map or reduce tasks.Job Configuration Distributed cache