DataManagedObject, Version, KeyValue and Codespace

DataManagedObject, together with Version and Codespace, provides uniform, common properties for all NeTEx entities. DataManagedObject is the abstract “upper object” of the NeTEx implementation and all first class NeTEx entities are specialisations of it.

Figure 1 — Versions and Codespaces (physical model) (UML)

In Transmodel, an ENTITY represents an actual object in a repository having an instance of data present in an exchanged data set. Several versions, usually successive, of an ENTITY can be defined; normally a given data set will contain one particular version (but it is also possible to have multiple versions present in the same data set if required).

Data exported to an XML document from a repository represents a snapshot of the state of a particular version of the data at a particular moment in time. The NeTEx XSD is therefore primarily concerned with instances of Transmodel's ENTITY IN VERSION. That is to say, data elements in a NeTEx XML document represent a particular version of each ENTITY. Even if a basic implementation of a repository holds only a representation of the current state of an ENTITY, rather than its entire history of versions, every time it makes an export it is in effect creating an ENTITY IN VERSION for the time of export; if the current version in the repository is changed, two successive exports would result in different states, i.e. different instances of the appropriate ENTITY IN VERSION.

In NeTEx, ENTITY IN VERSION is implemented as EntityInVersion; this is further specialised as DataManagedObject which also reifies certain other separate Transmodel concepts into a single abstract XML class. It provides a convenient way to bring together common versioning, responsibility, and validity condition features from Transmodel, together with the XML extension mechanisms supported by NeTEx, in a way that is completely uniform for all NeTEx objects and so is simpler to implement.

NeTEx uses Codespaces it to ensure that the identifiers of instances of elements in an XML document are unique, even if they come from many different sources.

A ValidityCondition is used to attach conditions to elements to indicate when they are current or the circumstances in which they should be used (e.g. “In winter”). VALIDITY CONDITIONs can be attached to specific elements and also, through VERSION FRAMEs (see below), to whole sets of objects, so that it is possible to be precise about the conditions governing the coherence and relevance of data.

The DataManagedObject allows the assignment of a Version and responsibility attributes to an EntityInVersion and also one or more instances of a ValidityCondition.

EntityInVersion (Abstract)

A specific version of an ENTITY, as found in a NeTEx document. The NeTEx EntityInVersion implements ENTITY IN VERSION by adding VERSION tracking attributes and VALIDITY CONDITIONs (see later below) for any NeTEx element.

EntityInVersion – XML Element (Abstract)

Classification

Name

Type

Cardinality

Description

::>

::>

Entity

::>

ENTITY ON VERSION inherits from ENTITY.

«FK»

dataSourceRef

DataSourceIdType

0:1

Data system which originated data instance.

«atr»

created

xsd:dateTime

0:1

Date and time of creation of ENTITY.

«atr»

changed

xsd:dateTime

0:1

Date and time of last change to ENTITY.

«enum»

modification

ModificationEnum

0:1

Nature of modification. Enumerated value:

·        new

·        revise

·        delete

·        unchanged

·delta

If unspecified, assume value from parent of containing frame.

«PK»

version

VersionIdType

0:1

VERSION number of this instance of the ENTITY IN VERSION.

«enum»

status

VersionStatusEnum

0:1

Status of ENTITY in VERSION. Default is ‘active’.

Only ‘active’ is supportd in EPIP

Enumerated value: active | inactive | other.

«FK»

derivedFromVersionRef

VersionIdType

0:1

Reference to VERSION from which this VERSION of the ENTITY was derived.

This is a VersionIdType, the same type as version, thus being a version number.

«FK»

compatibleWithVersion­FrameVersionRef

VersionIdType

0:1

Version of frame with which this version of ENTITY is compatible. Assumes a VERSION FRAME of the same id as current frame.

«FK»

derivedFromObjectRef

ObjectIdType

0:1

Identity of object from which this version of object was derived. Normally the same.

 

 

CHOICE

0:1

Optimisation: A single VALID BETWEEN may be used without a wrapper tag.

«cntd»

a

validityConditions

ValidityCondition

0:*

VALIDITY CONDITIONs conditioning entity.

 

b

ValidBetween

ValidBetween

0:1

OPTIMISATION. Simple version of a VALIDITY CONDITION. Comprises a simple period. NO UNIQUENESS CONSTRAINT

 

DataManagedObject (Abstract)

A DataManagedObject is a subclass of EntityInVersion that implements the common properties of all first-class NeTEx elements, such as:

A ResponsibilitySet, specifying data management and stakeholder roles for the object.

A KeyList: an optional set of an arbitrary list of key/value pairs to provide a simple extensibility mechanism. KeyList will not be used in the European profile (however it remains available for local extensions).

An Extensions tag that allows the association of arbitrary user defined XML elements with an element. Extensions shall not be used within the European profile.

A reference to a Branding for the ENTITY to attribute a marketing content for presenting the element.

AlternativeText a set of alternative variants of any textual attribute of the ENTITY (the main purpose being translations into other national language).

 

DataManagedObject is an implementation artefact and does not appear in Transmodel.

DataManagedObject – XML Element (Abstract)

Classification

Name

Type

Cardinality

Description

::>

::>

EntityInVersion

::>

DATA MANAGED OBJECT inherits from ENTITY IN VERSION.

«FK»

responsibilitySetRef

ResponsibilitySetIdType

1:1

Reference to a RESPONSIBILITY SET defining ownership and management responsibilities for objects.

«cntd»

keyList

KeyValue

0:*

A list of KeyValue pairs that may be associated with the object. This allows simple arbitrary user extensions.

«cntd»

Extensions

ExtensionStructure

0:1

User defined extension elements associated with elements.

«FK»

BrandingRef

BrandingRef

0:1

Reference to a BRANDING. (A TypeOfValue) with no added attributes

«cntd»

alternativeTexts

AlternativeText

0:*

Additional Translations of text elements.

 

KeyValue (Subcomponent of DataManagedObject)

A KeyValue is a convenient way to exchange additional specific data attributes not supported in NeTEx. Lists of values can be associated with any DataManagedObject. Note that the interpretation and use of each KeyValue requires an additional specification of its meaning by the local systems that use them.

In EPIP, extensions to elements will be based only on the Key/Value mechanism (i.e. the Extensions tag will not be used). Furthermore, KeyList shall not be used to carry information that can be encode elsewhere in NeTEx.

KeyValue – XML Element

Classification

Name

Type

Cardinality

Description

«PK»

typeOfKey

xsd:normalizedString

0:1

Type for KEY VALUE.

«PK»

Key

xsd:normalizedString

1:1

Key for KEY VALUE.

 

Value

xsd:normalizedString

1:1

Value for KEY VALUE.

Example from the Austrian NeTEx Profile:

Version

NeTEx can support the exchange of data from systems with advanced versioning systems that track every change to every element - as recorded by a version attribute on DataManagedObject. However, many legacy systems do not support such versioning capabilities and so the provision of fine-grained tracking data in a NeTEx document is optional in EPIP. However frame level versioning is mandatory in EPIP.

In Transmodel, VERSIONs are themselves first class objects and can be given descriptions that explain the purpose and validity of the VERSION. These can be exchanged in NeTEx as Version instances.

In EPIP the Version element itself not populated; a simple version number in the version attribute of an EPIP element is used to indicate the version.

Codespace

A CODESPACE is similar to the XML concept of a Namespace that gives a context within which the names of elements and attributes in an XML schema shall unique, (using the generic W3C domain mechanism), thus permitting the combining of elements from different schemas in the same aggregated model. In a similar way, NeTEx uses Codespaces to ensure that the identifiers of instances of elements in an XML document are unique, even if they come from many different systems and are placed in the same document – see examples below. This allows data elements from different countries, different regions, and different organisations to be combined without clashes.

Having an explicit Codespace object also makes it possible for applications (such as a profile validator) to reflect on and compute over the elements within Codespaces in a generic manner (i.e. they can determine which coding system is being used for a given object and use this knowledge to inform the way they process data). The Codespace is thus used for three purposes:

(i)      To give a context within which to identify objects uniquely.

(ii)    To indicate to consumer systems which identifier system to use to interpret codes (if they care).

(iii)   To manage the distributed allocation of new identifiers to new instances of objects.

The NeTEx schema includes XML uniqueness and referential integrity constraints that (a) check each identifier is unique and (b) that any versioned reference to an entity is satisfied (i.e. that the referenced element is also present elsewhere in the document). The checks can be applied automatically using any XML validator to any NeTEx document and are fundamental for establishing a basic quality level for data exchange. See section 8.15.

In the context of EPIP, CODESPACEs prevent identifier clashes among data provisions coming from different countries (and regions, authorities and operators within countries. Different code structures may be used within different CODESPACES – see 8.4-Codespaces and the structure of identifiers for further discussion.

In order to manage a uniform coding system for use by many stakeholders, it may be desirable to partition a CODESPACE into different ADMINISTRATIVE ZONEs, based on spatial, modal or other criteria, each with a responsible ORGANISATION. Within the CODESPACE, the use of a specific prefix or prefix range value may then be reserved for the allocation of identifiers to new entities falling under the criteria of that zone. This allows the issuing of new codes to be delegated to the managers of the zone on a distributed basis. The central registrar function is needed only to setting up of the zones and issue the prefixes in the first place. ADMINISTRATIVE ZONES are not exchanged directly in the strict EPIP, but can be referenced with an external reference from a ResponsibilitySet (see 5.1.10)

A Codespace is specified as a path expression by using an internet domain, obtained through IANA (the Internet Assigned Numbers Authority), which provides a process for registering a domain as globally unique. For example, ‘http://tfl.gov.uk ’, ‘http://bahn.de ’, ‘http://ratp.fr ’, ‘http://foo.com , or ‘http://sbb.ch . The path can be declared in the <XmlnsUrl> attribute of a Codespace element.

Each Codespace has an associated prefix which may be used as a surrogate for the codespace throughout a document and is declared on a VersionFrame using a Codespace element. These prefixes provide an efficient way of encoding unique identifiers in an XML document in that the short prefix can establish a unique context for each identifier on every element id in a document (rather than say, a long qualifying string or complex code structure), even if the identifiers come from heterogenous coding systems. The prefix does not have to be the same as the id of the Codespace (For example, the examples in this document mostly use a prefix of ‘epd’ for the ‘epip_data’ Codespace and a prefix of ‘epip’ for the ‘epip_metadata’ Codespace. Technically the prefix is local to the document and different prefixes can be used for the same Codespace in different documents, but it is helpful to data consumers to use the same prefix values consistently.

Within a codespace, a data provider may use any system of coding identifiers they like; the only requirement is to provide persistent unique identifiers for each separate entity instance within the codespace.

XML EXAMPLE   CODESPACE declarations:

               <CompositeFrame version="1.0" id="mybus:Line1">

                      <codespaces>

                              <Codespace id="napt">

                                     <Xmlns>napt</Xmlns>

                                     <XmlnsUrl>http://naptan.org.uk/naptan</XmlnsUrl>

                                     <Description>UK NaPTAN</Description>

                              </Codespace>

                              <Codespace id="era_stops">

                                     <Xmlns>era</Xmlns>

                                     <XmlnsUrl>http://era.org.eu/stops</XmlnsUrl>

                                     <Description>European Rail Authority</Description>

                              </Codespace>

                      </codespaces>

Examples of elements that use of the above CODESPACEs:

               <StopPlace id=”napt:4701234567”>

               <StopPlaceRef ref=”napt:4701234567”>

               <StopPlace id=”era:0018”>, etc.

 

An application processing a document is expected to understand any necessary rules peculiar to validating or interpreting the identifiers in a specific Codespace. For example, NaPTAN stop identifiers in the UK NAPTAN codespace have the structure ‘999 0 XXXXXXXXX’ where ‘999’ is an area prefix, ‘0’ is fixed and ‘XXXXXXXXX’ is a number unique within the area, whose length and nature depends on the mode of transport. An XML validator will not enforce the format rules, but an implementation that is aware of the CODESPACE and its rule system (as indicated by the domain url) is able to do so programmatically if it is relevant to do so (for example a system that is merely aggregating data to pass it onto others might be agnostic about the structure of identifiers). For the general purposes of emitting or parsing a NeTEx document, any such local rules are generally irrelevant; all that matters is that each entity instance can be uniquely identified.

Codespace – XML Element

Classification

Name

Type

Cardinality

Description

::>

::>

Entity

::>

CODESPACE inherits from ENTITY.

«PK»

id

CodespaceIdType

1:1

Identifier of CODESPACE. Unique within documents.

«AK»

Xmlns

xsd:NMTOKEN

1:1

CODESPACE prefix, unique within a given XML document e.g. 'napt'

 

XmlnsUrl

xsd:anyURI

0:1

CODESPACE path. Globally unique. For example,

http://naptan.org.uk/naptan or http:/vdv.de/vdv/haltstelle/

 

Description

xsd:string

0:1

Description of CODESPACE.

Example from the Austrian NeTEx Profile:

 

Related pages