No matter how many controls we put into place to prevent duplicate contact creation, inevitably we always end up with a few, especially when website traffic volume has been high. As such, best practices for managing contacts include a periodic review and merging of duplicate contacts (especially after a big event or fundraising campaign). CiviCRM is equipped with several features that help prevent duplicate contact creation.
Does your data contain special characters such as accented letters or names with apostrophes? Have you exported that data in CSV format and found that those special characters have been made even more special (funky, actually)?
If so, there is a setting called "Language Encoding" located in CiviCRM under Administer > Localization > Languages, Currency, Locations that can be configured as "UTF-8" such that any special characters being exported to CSV will be preserved when you open the file.