Use metadata mapping to associate a set of properties in a file whose values you would like to transfer between source and destination. It is analogous to metadata mapping in DataHub v3. You provide a common set of metadata properties between source and destination. When a file or file is transferred from source to destination (or viceversa), its property values are also transferred based on the defined map.
Metadata Mapping Options
Metadata Map
You can define a transfer job's metadata mapping in a job's transfer options json object.
Property | Description | Example | |
---|---|---|---|
schemas | A list of metadata schemas. |
|
Schema Mapping ("schemas" in Metadata Map)
A schema mapping defines a metadata mapping between source and destination. You can provide a source and/or destination schema id can the transfer engine can retrieve from the platform provider to get additional schema information about the properties in the mapping.
Whether metadata from one schema per file or multiple schemas per file will be imported is dependent upon the platform. If the platform supports multiple schemas, then metadata from multiple schemas will be imported. In the case of Office365 where only one schema is supported, for each file, the metadata importer first looks for custom metadata fields, then looks for metadata in the order that the schemas are specified in the schemas block.
Property | Description | Example | |
---|---|---|---|
source | The source schema definition to use in this mapping. When specified, it is used to load type information for all properties in the schema. |
| |
destination | The destination schema definition to use in this mapping. When specified, it is used to load type information for all properties in the schema. |
| |
default | Defines the schema as the fallback schema to use when mapping properties from source to destination if a mapper that matches the specified schema is not found. |
| |
mappings | A list of property associations between source and destination. You can provide the type of property, though it is not required. |
|
Property Definition Mapping ("mappings" in Schema)
A property definition mapping defines the map between a source and destination property.
Property | Description | Example | |
---|---|---|---|
source | The source property. |
| |
destination | The destination property the source maps to. |
| |
choices | A map of choices. This is used when the property is a choice of values (e.g. yes, no) and you need to map these values between the source and destination because they differ. |
|
Property Choice ("choices" in Property Definition Mapping)
A property choice defines the map between a source and destination choice value. For instance, if the property has valid values of true/false in the source and yes/no in the destination, you can use a property choice to map them respectively.
Property | Description | Example | |
---|---|---|---|
source | The name and value of the source. |
| |
destination | The name and value of the destination. |
|
Property Value Map ("source" and "destination" in Property Definition Mapping)
A property value map defines the actual property to map to (in either the source or destination) and what actions to take when a value is either missing or invalid during transfer.
Property | Description | Valid Values | Example | |
---|---|---|---|---|
property | The property to map to. |
| ||
when_missing | Action to take when the property value is missing during transfer. | skip default calculate fail |
| |
when_invalid | Action to take when the property value is invalid or cannot be coerced to the destination type during transfer. | fail warn skip default |
|
Property Definition ("property" in Property Value Map)
Property | Description | Valid Values | Example | |
---|---|---|---|---|
name | The name of the property. This is the only required field. You can alternatively use query_name, id or caption. |
| ||
type | An optional type. This is optional and typically provided by the schema definition from the platform if the schema id is specified in the schema mapping. | unknown boolean id integer datetime decimal html string uri lookup account |
|
Example JSON
|
Example JSON: Box - O365
This example includes custom metadata and template metadata.
In order to import metadata into O365, the columns must be defined in the library first (ie. cog → library settings → create column).
Source (Box) Metadata: In this table, the headings indicate the template name followed by the metadata field name.
custom_attribute1 | Legal | Affiliates | Legal | Classification | Legal | Agreement Type | Presales - RFI Response Archive | Initiative | |
---|---|---|---|---|---|
file1 | custom attribute1 value1 | ||||
file2 | custom attribute1 value2 | Talbot Underwriting Ltd | Restricted | Europe Distributor | |
file3 | custom attribute1 value3 | Data Warehousing | |||
file4 | custom attribute1 value4 | Ross Products; Abbot Diabetics Care | Restricted | Partner | Data Replication |
file5 | AMES | Restricted | Cloud | ||
file6 | Agility Logistics Ltd | Restricted | License | Data Quality | |
file7 | Master Data Management |
Destination (Office 365) Metadata: The order in which the schemas will be used for import are custom, Presales, Legal
custom_attribute2 | Affiliates | Classification | Agreement Type | Initiative | |
---|---|---|---|---|---|
file1 | custom attribute1 value1 | ||||
file2 | custom attribute1 value2 | ||||
file3 | custom attribute1 value3 | ||||
file4 | custom attribute1 value4 | ||||
file5 | AMES | Restricted | Cloud | ||
file6 | Data Quality | ||||
file7 | Master Data Management |
|
If multiple versions of a file exist and are being uploaded during the same job run, only the most current metadata is preserved and will be applied to all versions of the file being uploaded during that job run.