Sunday, April 3, 2016

What Is Data Mapping?

A Data Mapping Specification is a special type of data dictionary that shows how data from one information system maps to data from another information system. Creating a data mapping specification helps you and your project team avoid numerous potential issues, the kind that tend to surface late in development or during user acceptance testing and throw off project schedules, not to mention irritating your stakeholders.

Data Mapping Specifications are particularly valuable in the following types of projects:

1.) Data Migration – When source data is migrated to a new target data repository.
2.) Data Integration – When source data is sent to a target data repository on a regular basis and the two data sources do not share a common data model. The integration can happen hourly, daily, weekly, monthly, or even in real-time as is typically required for a system integration.

These might sound similar, and they are. The primary difference between the two is that after a data migration project is complete, the original source data is no longer used or maintained while after a data integration project is complete, both data sources are maintained.
 

The Key Elements of a Data Mapping Specification

Essentially a data mapping specification will analyze, on a field-by-field basis, how to move data from one system to another.  For example, if I were to orchestrate a data feed from the Bridging the Gap article repository to a search engine, I would want to map key attributes of the article, such as the title, category, and content to the attributes specified by the search engine. This analysis exercise would ensure that each piece of information ended up in the most appropriate place in the target data repository.

To achieve this goal, a Data Mapping Specification contains the following elements:

    List of attributes for the original source of data (often with additional information from the data dictionary)
    A corresponding (or “mapped”) list of attributes for the target data repository (again, with additional information from the data dictionary)
    Translation rules defining any data manipulation that needs to happen as information moves between the two sources, such as setting default values, combining fields, or mapping values

Data Mapping is About Resolving Potential Issues

Creating a Data Mapping Specification requires discovering and resolving potential issues prior to the data mapping being implemented. In data migrations and integrations, any number of differences between how data is stored can cause data to be lost or mis-represented.

For example, it may be that your source data has a text field and your target date repository uses an enumerated list. Without analyzing the data and providing logic for mapping the text values to the allowable list values (or initiating appropriate data clean-up efforts), you are likely to experience unexpected errors during the system migration.


Source: http://www.bridging-the-gap.com/what-is-data-mapping/