Describe the data set. Find out the discrepancies un the data. If there are similar entries try to fix it.