RIF-CS to Schema.org crosswalk
Schema.org is a structured data markup schema supported by major search engines. On-page markup helps search engines understand the information on web pages and provide richer search results. Research Data Australia makes available Schema.org metadata for collection and service records via a RIF-CS to Schema.org cross-walk. Google is utilising the Schema.org markup on Research Data Australia collection records to index and display search results in Google Dataset Search.
Table 1. RIF-CS mapping from Collection@type="dataset" or "collection" to Schema.org "Dataset"
Schema.org Property (v3.3) | Expected Type | Description | Property From | Maps to RIF-CS (v1.6) | Mapping Notes |
---|---|---|---|---|---|
version | Number or Text | The version of the CreativeWork embodied by a specified resource. | CreativeWork | registryObject:collection:citationInfo:citationMetadata:version | |
accountablePerson | Person | Specifies the Person that is legally accountable for the CreativeWork. | CreativeWork | relatedInfo|relatedObject:Party/name where relation[@type='isOwnedBy'] OR [@type='isManagedBy'] | |
alternateName | Text | An alias for the item. | Thing | collection/name[@type='alternative'] OR collection/name[@type='abbreviated'] | |
alternativeHeadline | Text | A secondary title of the CreativeWork. | CreativeWork | collection/name[@type='alternative'] OR collection/name[@type='abbreviated'] | |
citation | CreativeWork or Text | A citation or reference to another creative work, such as another publication, web page, scholarly article, etc. | CreativeWork | collection/relatedInfo[@type='publication'] OR relatedObject:collection where related collection [@type='publication'] | Map to CreativeWork - relatedObject|relatedInfo display title >> name - relatedObject|relatedInfo identifiers >> identifier |
contributor | Organization or Person | A secondary contributor to the CreativeWork or Event. | CreativeWork | No Mapping | |
creator | Organization or Person | The creator/author of this CreativeWork. This is the same as the Author property for CreativeWork. | CreativeWork | collection/citationInfo/citationMetadata/contributor OR relatedObject|relatedInfo party/name where relation=IsPrincipalInvestigatorOf OR relatedObject|relatedInfo party/name where relation=author OR relatedObject|relatedInfo party/name where relation=coInvestigator OR relatedObject|relatedInfo party/name where relation=hasCollector | - Where party[@type='group'] map to Organization type - Where party[@type=<NOT group>]map to Person type - Populate the name and identifier(where possible) properties of Organization and Person. - Where record exists in RDA also populate the URL with the RDA key= URL of record. "https://researchdata.ands.org.au/view?key=<insert key>" |
dateCreated | Date or DateTime | The date on which the CreativeWork was created or the item was added to a DataFeed. | CreativeWork | collection/citationInfo/citationMetadata/date[@type= 'created'] OR collection/dates[@type='created'] | |
datePublished | Date | Date of first broadcast/publication. | CreativeWork | collection/citationInfo/citationMetadata/date[@type='publicationDate'] OR collection/citationInfo/citationMetadata/date[@type='issued'] OR collection/dates[@type='dc.issued'] OR collection/dates[@type='dc.available'] OR Collection[@dateAccessioned] OR date the record was ingested into RDA | |
description | Text | A description of the item. | Thing | collection/description[@type='brief'] OR collection/description[@type='full'] | |
distribution | DataDownload | A downloadable form of this dataset, at a specific location, in a specific format. | Dataset | location/address/electronic @type="url" @ target="directDownload" | - Map to DataDownload Type - Map electronic/byteSize to contentSize in DataDownload type. - Map electronic/value to contentUrl in DataDownload type. - Map electronic/mediaType to encodingFormat in DataDownload type. Map electronic/notes to description in DataDownload |
encodingFormat | Text or URL | Media type, typically MIME format (see IANA site) of the content e.g. application/zip of a SoftwareApplication binary. In cases where a CreativeWork has several media type representations, 'encoding' can be used to indicate each MediaObject alongside particular fileFormat information. Unregistered or niche file formats can be indicated instead via the most appropriate URL, e.g. defining Web page or a Wikipedia entry. | CreativeWork | collection/address/electronic[@target='directDownload']/mediaType | |
funder | Organization or Person | A person or organization that supports (sponsors) something through some kind of financial contribution. | CreativeWork | relatedObject|relatedInfo party/name where relation=isFundedBy | - Grants network relationship. Collection relates to activity. Activity relates to party. - Where party[@type='group'] map to Organization type - Where party[@type=<NOT group>]map to Person type - Populate the name and identifier(where possible) properties of Organization and Person. - Where record exists in RDA also populate the URL with the RDA key= URL of record. "https://researchdata.edu.au/view?key=<insert key>" |
hasPart | CreativeWork | Indicates a CreativeWork that is (in some sense) a part of this CreativeWork. Inverse property: isPartOf. | CreativeWork | relatedObject|relatedInfo collection where relation[@type='hasPart'] | Map basic collection info to Dataset(CreativeWork) |
identifier | PropertyValue or Text or URL | The identifier property represents any kind of identifier for any kind of Thing, such as ISBNs, GTIN codes, UUIDs etc. Schema.org provides dedicated properties for representing many of these, either as textual strings or as URL (URI) links. See background notes for more details. | Thing | collection/citationInfo/citationMetadata/identifier AND/OR collection/identifier | - Map to URL type where possible. Based on RDA resolvable identifier rules. - Where resolvable identifier link is not possible, map as PropertyValue type. Identifier type + identifier value. E.g."identifier": {"@type": "PropertyValue", "propertyID": "OCoLC","value": "889647468"}, |
includedInDataCatalog | DataCatalog | A data catalog which contains this dataset. Supersedes catalog, includedDataCatalog. Inverse property: dataset. | Dataset | No Mapping | |
inLanguage | Language or Text | The language of the content or performance or used in an action. Please use one of the language codes from the IETF BCP 47 standard. See also availableLanguage. Supersedes language. | CreativeWork | (default to English) "en" | |
isAccessibleForFree | Boolean | A flag to signal that the publication is accessible for free. Supersedes free. | CreativeWork | No Mapping | |
isBasedOn | CreativeWork or Product or URL | A resource that was used in the creation of this resource. This term can be repeated for multiple sources. For example, http://example.com/great-multiplication-intro.html. Supersedes isBasedOnUrl. | CreativeWork | relatedObject|relatedInfo collection where relation[@type='isDerivedFrom'] | Map basic collection info to Dataset(CreativeWork) |
isPartOf | CreativeWork | Indicates a CreativeWork that this CreativeWork is (in some sense) part of. Inverse property: hasPart. | CreativeWork | relatedObject|relatedInfo collection where relation[@type='isPartOf'] | Map basic collection info to Dataset(CreativeWork) |
keywords | Text | Keywords or tags used to describe this content. Multiple entries in a keywords list are typically delimited by commas. | CreativeWork | collection/subject | |
license | CreativeWork or URL | A license document that applies to this content, typically indicated by URL. | CreativeWork | collection/rights/licence[@rightsURI] AND/OR collection/rights/licence[@type] AND collection/rights/licence | |
mentions | Thing | Indicates that the CreativeWork contains a reference to, but is not necessarily about a concept. | CreativeWork | No Mapping | |
name | Text | The name of the item. | Thing | registryObject:collection:name (Title as displayed in RDA) | |
publisher | Organization or Person | The publisher of the creative work. | CreativeWork | registryObject:collection:citationInfo:citationMetadata:publisher OR registryObject@Group | |
sourceOrganization | Organization | The Organization on whose behalf the creator was working. | CreativeWork | registryObject@Group | |
spatialCoverage | Place or GeoCoordinates or GeoShape | The spatialCoverage of a CreativeWork indicates the place(s) which are the focus of the content. It is a subproperty of contentLocation intended primarily for more technical and detailed materials. For example with a Dataset, it indicates areas that the dataset describes: a dataset of New York weather would have spatialCoverage which was the place: the state of New York. Supersedes spatial. | CreativeWork | collection/coverage/spatial | Where spatial type is one of "dcmiPoint", "iso19139dcmiBox", "gmlKmlPolyCoords" or "kmlPolyCoords" Map to GeoCoordinates or e.g. {"@type": "Place", "geo": {"@type": "GeoCoordinates", "latitude": "-22.917923", "longitude": "143.085938" }}, { "@type": "Place", "geo": {"@type": "GeoShape", "box": "-35.173808 110.390625 -14.093957 150.644531" }}, |
temporalCoverage | DateTime or Text or URL | The temporalCoverage of a CreativeWork indicates the period that the content applies to, i.e. that it describes, either as a DateTime or as a textual string indicating a time period in ISO 8601 time interval format. In the case of a Dataset it will typically indicate the relevant time period in a precise notation (e.g. for a 2011 census dataset, the year 2011 would be written "2011/2012"). Other forms of content e.g. ScholarlyArticle, Book, TVSeries or TVEpisode may indicate their temporalCoverage in broader terms - textually or via well-known URL. Written works such as books may sometimes have precise temporal coverage too, e.g. a work set in 1939 - 1945 can be indicated in ISO 8601 interval format format via "1939/1945". Supersedes datasetTimeInterval, temporal. | CreativeWork | collection/coverage/temporal | Where possible map temporal coverage to start and end dates in DateTime Where date @type="dateFrom" +/- date @type="dateTo", then separate with a "/" e.g. "1950-01-01/2013-12-18" (date range), "2013-12-19/.." or "../2019" (open-ended) |
url | URL | URL of the item. | Thing | RDA key= URL of record. "https://researchdata.edu.au/view?key=<insert key>" |
Table 2. RIF-CS mapping from Collection@type="software" to Schema.org "SoftwareSourceCode"
Schema.org Property (v3.3) | Expected Type | Description | Property From | Maps to RIF-CS (v1.6) | Mapping Notes |
---|---|---|---|---|---|
codeRepository | URL | Link to the repository where the un-compiled, human readable code and related code is located (SVN, github, CodePlex). | SoftwareSourceCode | collection/address/electronic[@type=url] | |
accountablePerson | Person | Specifies the Person that is legally accountable for the CreativeWork. | CreativeWork | relatedInfo|relatedObject:Party/name where relation[@type='isOwnedBy'] | |
alternativeHeadline | Text | A secondary title of the CreativeWork. | CreativeWork | collection/name[@type='alternative'] OR collection/name[@type='abbreviated'] | |
author | Organization or Person | The author of this content or rating. Please note that author is special in that HTML 5 provides a special mechanism for indicating authorship via the rel tag. That is equivalent to this and may be used interchangeably. | CreativeWork | collection/citationInfo/citationMetadata/contributor OR ( relatedObject|relatedInfo party/name where relation=IsPrincipalInvestigatorOf OR/AND relatedObject|relatedInfo party/name where relation=author OR/AND relatedObject|relatedInfo party/name where relation=coInvestigator ) OR relatedObject|relatedInfo party/name where relation=isOwnedBy OR relatedObject|relatedInfo party/name where relation=hasCollector | - Where party[@type='group'] map to Organization type - Where party[@type=<NOT group>]map to Person type - Populate the name and identifier(where possible) properties of Organization and Person. - Where record exists in RDA also populate the URL with the RDA key= URL of record. "https://researchdata.edu.au/view?key=<insert key>" |
citation | CreativeWork or Text | A citation or reference to another creative work, such as another publication, web page, scholarly article, etc. | CreativeWork | collection/relatedInfo[@type='publication'] OR relatedObject:collection where related collection [@type='publication'] | Map to CreativeWork - relatedObject|relatedInfo display title >> name - relatedObject|relatedInfo identifiers >> identifier |
contributor | Organization or Person | A secondary contributor to the CreativeWork or Event. | CreativeWork | No Mapping | |
creator | Organization or Person | The creator/author of this CreativeWork. This is the same as the Author property for CreativeWork. | CreativeWork | collection/citationInfo/citationMetadata/contributor OR relatedObject|relatedInfo party/name where relation=IsPrincipalInvestigatorOf OR relatedObject|relatedInfo party/name where relation=author OR relatedObject|relatedInfo party/name where relation=coInvestigator OR relatedObject|relatedInfo party/name where relation=isOwnedBy OR rrelatedObject|relatedInfo party/name where relation=hasCollector | - Where party[@type='group'] map to Organization type - Where party[@type=<NOT group>]map to Person type - Populate the name and identifier(where possible) properties of Organization and Person. - Where record exists in RDA also populate the URL with the RDA key= URL of record. "https://researchdata.edu.au/view?key=<insert key>" |
dateCreated | Date or DateTime | The date on which the CreativeWork was created or the item was added to a DataFeed. | CreativeWork | collection/citationInfo/citationMetadata/date[@type= 'created'] OR collection/dates[@type='created'] | |
dateModified | Date or DateTime | The date on which the CreativeWork was most recently modified or when the item's entry was modified within a DataFeed. | CreativeWork | Collection[@dateModified] | |
datePublished | Date | Date of first broadcast/publication. | CreativeWork | collection/citationInfo/citationMetadata/date[@type='publicationDate'] OR collection/citationInfo/citationMetadata/date[@type='issued'] OR collection/citationInfo/citationMetadata/date[@type='created'] OR collection/dates[@type='dc.issued'] OR collection/dates[@type='dc.available'] OR collection/dates[@type='dc.created'] OR Collection[@dateModified] OR Collection[@dateAccessioned] OR date the record was ingested into RDA | |
fileFormat | Text or URL | Media type, typically MIME format (see IANA site) of the content e.g. application/zip of a SoftwareApplication binary. In cases where a CreativeWork has several media type representations, 'encoding' can be used to indicate each MediaObject alongside particular fileFormat information. Unregistered or niche file formats can be indicated instead via the most appropriate URL, e.g. defining Web page or a Wikipedia entry. | CreativeWork | collection/address/electronic[@target='directDownload']/mediaType | |
funder | Organization or Person | A person or organization that supports (sponsors) something through some kind of financial contribution. | CreativeWork | relatedObject|relatedInfo party/name where relation=isFundedBy | - Grants network relationship. Collection relates to activity. Activity relates to party. - Where party[@type='group'] map to Organization type - Where party[@type=<NOT group>]map to Person type - Populate the name and identifier(where possible) properties of Organization and Person. - Where record exists in RDA also populate the URL with the RDA key= URL of record. "https://researchdata.edu.au/view?key=<insert key>" |
hasPart | CreativeWork | Indicates a CreativeWork that is (in some sense) a part of this CreativeWork. Inverse property: isPartOf. | CreativeWork | relatedObject|relatedInfo collection where relation[@type='hasPart'] | Map basic collection info to Software|Dataset(CreativeWork) |
inLanguage | Language or Text | The language of the content or performance or used in an action. Please use one of the language codes from the IETF BCP 47 standard. See also availableLanguage. Supersedes language. | CreativeWork | (default to English) "en" | |
isAccessibleForFree | Boolean | A flag to signal that the item, event, or place is accessible for free. Supersedes free. | CreativeWork | No Mapping | |
isBasedOn | CreativeWork or Product or URL | A resource that was used in the creation of this resource. This term can be repeated for multiple sources. For example, http://example.com/great-multiplication-intro.html. Supersedes isBasedOnUrl. | CreativeWork | relatedObject|relatedInfo collection where relation[@type='isDerivedFrom'] | Map basic collection info to Software|Dataset(CreativeWork) |
isPartOf | CreativeWork | Indicates a CreativeWork that this CreativeWork is (in some sense) part of. Inverse property: hasPart. | CreativeWork | relatedObject|relatedInfo collection where relation[@type='isPartOf'] | Map basic collection info to Software|Dataset(CreativeWork) |
keywords | Text | Keywords or tags used to describe this content. Multiple entries in a keywords list are typically delimited by commas. | CreativeWork | collection/subject | |
license | CreativeWork or URL | A license document that applies to this content, typically indicated by URL. | CreativeWork | collection/rights/licence[@rightsURI] AND/OR collection/rights/licence[@type] AND collection/rights/licence | |
mentions | Thing | Indicates that the CreativeWork contains a reference to, but is not necessarily about a concept. | CreativeWork | No Mapping | |
publisher | Organization or Person | The publisher of the creative work. | CreativeWork | registryObject:collection:citationInfo:citationMetadata:publisher OR registryObject@Group | |
sourceOrganization | Organization | The Organization on whose behalf the creator was working. | CreativeWork | registryObject@Group | |
spatialCoverage | Place | The spatialCoverage of a CreativeWork indicates the place(s) which are the focus of the content. It is a subproperty of contentLocation intended primarily for more technical and detailed materials. For example with a Dataset, it indicates areas that the dataset describes: a dataset of New York weather would have spatialCoverage which was the place: the state of New York. Supersedes spatial. | CreativeWork | collection/coverage/spatial | |
temporalCoverage | DateTime or Text or URL | The temporalCoverage of a CreativeWork indicates the period that the content applies to, i.e. that it describes, either as a DateTime or as a textual string indicating a time period in ISO 8601 time interval format. In the case of a Dataset it will typically indicate the relevant time period in a precise notation (e.g. for a 2011 census dataset, the year 2011 would be written "2011/2012"). Other forms of content e.g. ScholarlyArticle, Book, TVSeries or TVEpisode may indicate their temporalCoverage in broader terms - textually or via well-known URL. Written works such as books may sometimes have precise temporal coverage too, e.g. a work set in 1939 - 1945 can be indicated in ISO 8601 interval format format via "1939/1945". Supersedes datasetTimeInterval, temporal. | CreativeWork | collection/coverage/temporal | |
version | Number or Text | The version of the CreativeWork embodied by a specified resource. | CreativeWork | registryObject:collection:citationInfo:citationMetadata:version | |
alternateName | Text | An alias for the item. | Thing | collection/name[@type='alternative'] OR collection/name[@type='abbreviated'] | |
description | Text | A description of the item. | Thing | collection/description[@type='brief'] OR collection/description[@type='full'] | |
identifier | PropertyValue or Text or URL | The identifier property represents any kind of identifier for any kind of Thing, such as ISBNs, GTIN codes, UUIDs etc. Schema.org provides dedicated properties for representing many of these, either as textual strings or as URL (URI) links. See background notes for more details. | Thing | collection/citationInfo/citationMetadata/identifier AND/OR collection/identifier | - Map to URL type where possible. Based on RDA resolvable identifier rules. - Where resolvable identifier link is not possible, map as PropertyValue type. Identifier type + identifier value. E.g. "identifier": { "@type": "PropertyValue", "propertyID": "OCoLC", "value": "889647468"}, |
name | Text | The name of the item. | Thing | registryObject:collection:name (Title as displayed in RDA) | |
url | URL | URL of the item. | Thing | RDA key= URL of record. "https://researchdata.edu.au/view?key=<insert key>" |
Table 3. RIF-CS mapping from RegistryObject/Service to Schema.org "Service"
Schema.org Property (v3.3) | Expected Type | Description | Property From | Maps to RIF-CS (v1.6) | Mapping Notes |
---|---|---|---|---|---|
provider | Organization or Person | The service provider, service operator, or service performer; the goods producer. Another party (a seller) may offer those services or goods on behalf of the provider. A provider may also serve as the seller. Supersedes carrier. | Service | relatedInfo|relatedObject:Party/name where relation[@type='isOwnedBy' | 'isManagedBy'] OR registryObject@Group | |
serviceType | Text | The type of service being offered, e.g. veterans' benefits, emergency relief, etc. | Service | service[@type] | |
termsOfService | Text or URL | Human-readable terms of service documentation. | Service | collection/rights/licence[@rightsURI] AND/OR collection/rights/licence[@type] AND/OR collection/rights/licence AND/OR collection/rights/accessRights AND/OR collection/rights/accessRights[@rightsURI] | |
alternateName | Text | An alias for the item. | Thing | service/name[@type='alternative'] OR service/name[@type='abbreviated'] | |
description | Text | A description of the item. | Thing | service/description[@type='brief'] OR service/description[@type='full'] | |
identifier | PropertyValue or Text or URL | The identifier property represents any kind of identifier for any kind of Thing, such as ISBNs, GTIN codes, UUIDs etc. Schema.org provides dedicated properties for representing many of these, either as textual strings or as URL (URI) links. See background notes for more details. | Thing | service/identifier | - Map to URL type where possible. Based on RDA resolvable identifier rules. - Where resolvable identifier link is not possible, map as PropertyValue type. Identifier type + identifier value. E.g. "identifier": {"@type": "PropertyValue", "propertyID": "OCoLC","value": "889647468"}, |
name | Text | The name of the item. | Thing | service/name (Title as displayed in RDA) | |
url | URL | URL of the item. | Thing | RDA key= URL of record. "https://researchdata.edu.au/view?key=<insert key>" |