OpenMinTeD

Documentation references should be versioned

abstract

final

recommended

Machine-readable metadata for UIMA components

concrete

draft

mandatory

WG1, WG4 (44)

Embedding UIMA component metadata into the source code

concrete

draft

mandatory

WG1, WG4 (44)

Separating UIMA metadata from the component

concrete

draft

mandatory

WG1, WG4 (44)

Specifying input and output types of UIMA components

concrete

draft

mandatory

WG1, WG4 (44)

Documentation of UIMA components

concrete

draft

mandatory

WG1, WG4 (44)

Embedding GATE component metadata into the source code

concrete

draft

mandatory

WG1, WG4 (44)

Documentation of GATE components

concrete

draft

mandatory

WG1, WG4 (44)

Separating GATE metadata from the component

concrete

draft

mandatory

WG1, WG4 (44)

Embedding output format in UIMA component metadata

concrete

draft

mandatory

WG1, WG4 (44)

Version documentation in parallel with component/resource

concrete

draft

recommended

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

concrete

draft

mandatory

WG1, WG4 (44)

Encoding citable publications (for scholarly attribution) in resource metadata records

concrete

draft

recommended

WG1, WG2 (27), WG4 (44)

Including license text in resource packages

concrete

draft

recommended

WG1, WG3 (25), WG4 (44)

Unique identifiers and versions for components using Maven

concrete

draft

mandatory

WG1, WG4 (44)

Encoding in the metadata a direct access link for content resources

concrete

draft

mandatory

WG1

Providing access to content resources (sharing/exposing and transferring)

concrete

draft

mandatory

WG1

Making models and annotation resources accessible as entities distinct from the components they are compatible with

concrete

draft

recommended

WG1, WG2 (27), WG4 (44)

Adding version information in the metadata descriptions of all resources

concrete

draft

mandatory

Specifying access mode of resources and encoding it in the metadata descriptions

concrete

draft

mandatory

WG1, WG2 (27), WG4 (44)

Encoding funding information in the metadata descriptions of all resources

concrete

draft

recommended

WG1, WG2 (27), WG4 (44)

Encoding of format in the metadata description of content resources

concrete

draft

mandatory

WG1

Encoding licensing terms in the metadata description of the resource

concrete

draft

mandatory

WG1, WG3 (25)

Encoding metadata on domain/subject/ classification for all resources when applicable

concrete

draft

recommended

WG1, WG2 (27)

Encoding language information in the metadata of content resources

concrete

draft

mandatory

WG1, WG2 (27)

Encoding statistical information in the content resources

concrete

draft

mandatory

WG1, WG2 (27)

Assigning a unique persistent identifier for all resources

concrete

draft

mandatory

WG1, WG2 (27), WG3 (25)

WG2 (27)

ID	Requirement	Concreteness	Status	Strength	WG’s
4	URL to actual content must be discoverable	abstract	final	mandatory	WG1 (40), WG2, WG3 (25)
10	Components should specify the types of the annotations that they input and output	abstract	draft	mandatory	WG4 (44), WG2
36	Classification metadata should be included, where applicable, in the metadata record of the resource	abstract	final	recommended	WG1 (40), WG2
38	Access mode of resources must be included in the metadata	abstract	final	mandatory	WG1 (40), WG2, WG4 (44)
41	Content resources must include metadata on their language(s)	abstract	final	mandatory	WG1 (40), WG2
44	Statistical metadata that allow monitoring of resource versions may accompany resources	abstract	final	optional	WG1 (40), WG2
47	Information on funding of resources may be included in the metadata	abstract	final	optional	WG1 (40), WG2, WG3 (25), WG4 (44)
50	Documentation references should be versioned	abstract	final	recommended	WG1 (40), WG2, WG3 (25), WG4 (44)
67	Knowledge Resource Element Id	abstract	final	recommended	WG2
68	Data Category Linking Vocabulary	abstract	final	recommended	WG2
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	abstract	final	recommended	WG2
70	All KR content elements need to be added as text annotations within a TDM workflow.	abstract	final	mandatory	WG2
71	The KR should be ingestible through a URI	abstract	final	recommended	WG2
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	abstract	final	recommended	WG2
89	Version documentation in parallel with component/resource	concrete	draft	recommended	WG1 (40), WG2, WG3 (25), WG4 (44)
91	Encoding citable publications (for scholarly attribution) in resource metadata records	concrete	draft	recommended	WG1 (40), WG2, WG4 (44)
93	Provide identifiers for knowledge resource elements	concrete	draft	recommended	WG2
94	Data Category Linking Vocabulary	concrete	draft	recommended	WG2
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	concrete	draft	recommended	WG2
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	concrete	draft	recommended	WG1 (40), WG2, WG4 (44)
102	Adding version information in the metadata descriptions of all resources	concrete	draft	mandatory	WG1 (40), WG2, WG3 (25), WG4 (44)
103	Specifying access mode of resources and encoding it in the metadata descriptions	concrete	draft	mandatory	WG1 (40), WG2, WG4 (44)
104	Encoding funding information in the metadata descriptions of all resources	concrete	draft	recommended	WG1 (40), WG2, WG4 (44)
107	Encoding metadata on domain/subject/ classification for all resources when applicable	concrete	draft	recommended	WG1 (40), WG2
108	Encoding language information in the metadata of content resources	concrete	draft	mandatory	WG1 (40), WG2
109	Encoding statistical information in the content resources	concrete	draft	mandatory	WG1 (40), WG2
110	Assigning a unique persistent identifier for all resources	concrete	draft	mandatory	WG1 (40), WG2, WG3 (25)

Requirement

Concreteness

Status

Strength

WG’s

URL to actual content must be discoverable

abstract

final

mandatory

WG1 (40), WG2, WG3 (25)

Components should specify the types of the annotations that they input and output

abstract

draft

mandatory

WG4 (44), WG2

Classification metadata should be included, where applicable, in the metadata record of the resource

abstract

final

recommended

WG1 (40), WG2

Access mode of resources must be included in the metadata

abstract

final

mandatory

WG1 (40), WG2, WG4 (44)

Content resources must include metadata on their language(s)

abstract

final

mandatory

WG1 (40), WG2

Statistical metadata that allow monitoring of resource versions may accompany resources

abstract

final

optional

WG1 (40), WG2

Information on funding of resources may be included in the metadata

abstract

final

optional

Documentation references should be versioned

abstract

final

recommended

Knowledge Resource Element Id

abstract

final

recommended

WG2

Data Category Linking Vocabulary

abstract

final

recommended

WG2

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

abstract

final

recommended

WG2

All KR content elements need to be added as text annotations within a TDM workflow.

abstract

final

mandatory

WG2

The KR should be ingestible through a URI

abstract

final

recommended

WG2

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

abstract

final

recommended

WG2

Version documentation in parallel with component/resource

concrete

draft

recommended

Encoding citable publications (for scholarly attribution) in resource metadata records

concrete

draft

recommended

WG1 (40), WG2, WG4 (44)

Provide identifiers for knowledge resource elements

concrete

draft

recommended

WG2

Data Category Linking Vocabulary

concrete

draft

recommended

WG2

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

concrete

draft

recommended

WG2

Making models and annotation resources accessible as entities distinct from the components they are compatible with

concrete

draft

recommended

WG1 (40), WG2, WG4 (44)

Adding version information in the metadata descriptions of all resources

concrete

draft

mandatory

Specifying access mode of resources and encoding it in the metadata descriptions

concrete

draft

mandatory

WG1 (40), WG2, WG4 (44)

Encoding funding information in the metadata descriptions of all resources

concrete

draft

recommended

WG1 (40), WG2, WG4 (44)

Encoding metadata on domain/subject/ classification for all resources when applicable

concrete

draft

recommended

WG1 (40), WG2

Encoding language information in the metadata of content resources

concrete

draft

mandatory

WG1 (40), WG2

Encoding statistical information in the content resources

concrete

draft

mandatory

WG1 (40), WG2

Assigning a unique persistent identifier for all resources

concrete

draft

mandatory

WG1 (40), WG2, WG3 (25)

WG3 (25)

ID	Requirement	Concreteness	Status	Strength	WG’s
4	URL to actual content must be discoverable	abstract	final	mandatory	WG1 (40), WG2 (27), WG3
33	Licensing information must be included in the metadata	abstract	final	mandatory	WG1 (40), WG3
34	Licensing information should be expressed in a machine-readable form	abstract	final	recommended	WG1 (40), WG3
47	Information on funding of resources may be included in the metadata	abstract	final	optional	WG1 (40), WG2 (27), WG3, WG4 (44)
50	Documentation references should be versioned	abstract	final	recommended	WG1 (40), WG2 (27), WG3, WG4 (44)
51	License should be attached	abstract	draft	recommended	WG3
53	Licensor must be entitled to grant license	abstract	draft	recommended	WG3
54	Licensees should remain with a copy of the license	abstract	draft	recommended	WG3
55	Standard licenses should be used	abstract	draft	recommended	WG3
56	License should be machine readable	abstract	draft	recommended	WG3
57	License should be understandable by non-lawyers	abstract	draft	recommended	WG3
58	TDM must be explicitly allowed	abstract	draft	recommended	WG3
59	Right for (temporary) reproduction must be granted	abstract	draft	recommended	WG3
60	Boundary for derivative work must be clearly defined	abstract	draft	recommended	WG3
61	No restrictions on TDM results which are not derived works	abstract	draft	recommended	WG3
62	World-wide and irrevocable license grant	abstract	draft	recommended	WG3
63	License must qualify for Open Access rights	abstract	draft	recommended	WG3
64	License must qualify for Open Access uses	abstract	draft	recommended	WG3
65	License must qualify for Open Access must not restrict use in any way	abstract	draft	recommended	WG3
66	License must qualify for Open Access may include attribution requirements	abstract	draft	recommended	WG3
89	Version documentation in parallel with component/resource	concrete	draft	recommended	WG1 (40), WG2 (27), WG3, WG4 (44)
92	Including license text in resource packages	concrete	draft	recommended	WG1 (40), WG3, WG4 (44)
102	Adding version information in the metadata descriptions of all resources	concrete	draft	mandatory	WG1 (40), WG2 (27), WG3, WG4 (44)
106	Encoding licensing terms in the metadata description of the resource	concrete	draft	mandatory	WG1 (40), WG3
110	Assigning a unique persistent identifier for all resources	concrete	draft	mandatory	WG1 (40), WG2 (27), WG3

Requirement

Concreteness

Status

Strength

WG’s

URL to actual content must be discoverable

abstract

final

mandatory

WG1 (40), WG2 (27), WG3

Licensing information must be included in the metadata

abstract

final

mandatory

WG1 (40), WG3

Licensing information should be expressed in a machine-readable form

abstract

final

recommended

WG1 (40), WG3

Information on funding of resources may be included in the metadata

abstract

final

optional

Documentation references should be versioned

abstract

final

recommended

License should be attached

abstract

draft

recommended

WG3

Licensor must be entitled to grant license

abstract

draft

recommended

WG3

Licensees should remain with a copy of the license

abstract

draft

recommended

WG3

Standard licenses should be used

abstract

draft

recommended

WG3

License should be machine readable

abstract

draft

recommended

WG3

License should be understandable by non-lawyers

abstract

draft

recommended

WG3

TDM must be explicitly allowed

abstract

draft

recommended

WG3

Right for (temporary) reproduction must be granted

abstract

draft

recommended

WG3

Boundary for derivative work must be clearly defined

abstract

draft

recommended

WG3

No restrictions on TDM results which are not derived works

abstract

draft

recommended

WG3

World-wide and irrevocable license grant

abstract

draft

recommended

WG3

License must qualify for Open Access rights

abstract

draft

recommended

WG3

License must qualify for Open Access uses

abstract

draft

recommended

WG3

License must qualify for Open Access must not restrict use in any way

abstract

draft

recommended

WG3

License must qualify for Open Access may include attribution requirements

abstract

draft

recommended

WG3

Version documentation in parallel with component/resource

concrete

draft

recommended

Including license text in resource packages

concrete

draft

recommended

WG1 (40), WG3, WG4 (44)

Adding version information in the metadata descriptions of all resources

concrete

draft

mandatory

Encoding licensing terms in the metadata description of the resource

concrete

draft

mandatory

WG1 (40), WG3

Assigning a unique persistent identifier for all resources

concrete

draft

mandatory

WG1 (40), WG2 (27), WG3

WG4 (44)

ID	Requirement	Concreteness	Status	Strength	WG’s
1	Components must be described by machine-readable metadata	abstract	final	mandatory	WG4
2	Component metadata have to be embedded into the component source code	abstract	final	mandatory	WG4
3	Component metadata must be separable from the component	abstract	final	mandatory	WG4
5	Components must detail all their environmental requirements for execution	abstract	draft	mandatory	WG4
6	Components should have a unique identifier and a version number	abstract	draft	mandatory	WG4
7	Components must have a fully qualified name that follows the Java class naming conventions	concrete	final	mandatory	WG4
8	Components must associate themselves with categories defined by the OpenMinTeD project	abstract	final	mandatory	WG4
9	Components must declare their annotation schema dependencies	abstract	final	mandatory	WG4
10	Components should specify the types of the annotations that they input and output	abstract	draft	mandatory	WG4, WG2 (27)
11	Components must declare whether they can be scaled within a workflow	abstract	draft	mandatory	WG4
12	Components should provide documentation describing their functionality	abstract	final	recommended	WG4
13	Citation information for component should be included in the metadata	abstract	draft	recommended	WG1 (40), WG4
16	Models/resources should be useable across different component collections/platforms	abstract	final	recommended	WG4
17	Components should be stateless	concrete	final	recommended	WG4
21	Configuration and parametrizable options of the components should be identified and documented	abstract	final	recommended	WG4
26	It should be possible to determine the source of an annotation/assigned category	abstract	final	recommended	WG4
28	Processing components should be downloadable	abstract	final	recommended	WG4
38	Access mode of resources must be included in the metadata	abstract	final	mandatory	WG1 (40), WG2 (27), WG4
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	abstract	final	mandatory	WG1 (40), WG4
45	S/W (tools, web services, workflows) must indicate format of their output	abstract	final	mandatory	WG1 (40), WG4
47	Information on funding of resources may be included in the metadata	abstract	final	optional	WG1 (40), WG2 (27), WG3 (25), WG4
50	Documentation references should be versioned	abstract	final	recommended	WG1 (40), WG2 (27), WG3 (25), WG4
73	Stick to widely used data compression formats	concrete	draft	best practice	WG4
74	Machine-readable metadata for UIMA components	concrete	draft	mandatory	WG1 (40), WG4
75	Embedding UIMA component metadata into the source code	concrete	draft	mandatory	WG1 (40), WG4
76	Separating UIMA metadata from the component	concrete	draft	mandatory	WG1 (40), WG4
78	Specifying input and output types of UIMA components	concrete	draft	mandatory	WG1 (40), WG4
79	Documentation of UIMA components	concrete	draft	mandatory	WG1 (40), WG4
81	Embedding GATE component metadata into the source code	concrete	draft	mandatory	WG1 (40), WG4
83	Documentation of GATE components	concrete	draft	mandatory	WG1 (40), WG4
84	Separating GATE metadata from the component	concrete	draft	mandatory	WG1 (40), WG4
88	Embedding output format in UIMA component metadata	concrete	draft	mandatory	WG1 (40), WG4
89	Version documentation in parallel with component/resource	concrete	draft	recommended	WG1 (40), WG2 (27), WG3 (25), WG4
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	concrete	draft	mandatory	WG1 (40), WG4
91	Encoding citable publications (for scholarly attribution) in resource metadata records	concrete	draft	recommended	WG1 (40), WG2 (27), WG4
92	Including license text in resource packages	concrete	draft	recommended	WG1 (40), WG3 (25), WG4
96	Unique identifiers and versions for components using Maven	concrete	draft	mandatory	WG1 (40), WG4
97	Declaring scaleout capability in UIMA	concrete	draft	mandatory	WG4
98	Publishing components via software repositories (Maven, Docker)	concrete	draft	mandatory	WG4
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	concrete	draft	recommended	WG1 (40), WG2 (27), WG4
102	Adding version information in the metadata descriptions of all resources	concrete	draft	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4
103	Specifying access mode of resources and encoding it in the metadata descriptions	concrete	draft	mandatory	WG1 (40), WG2 (27), WG4
104	Encoding funding information in the metadata descriptions of all resources	concrete	draft	recommended	WG1 (40), WG2 (27), WG4
111	Annotation schema dependencies for UIMA components using Maven	concrete	draft	mandatory	WG4

Requirement

Concreteness

Status

Strength

WG’s

Components must be described by machine-readable metadata

abstract

final

mandatory

WG4

Component metadata have to be embedded into the component source code

abstract

final

mandatory

WG4

Component metadata must be separable from the component

abstract

final

mandatory

WG4

Components must detail all their environmental requirements for execution

abstract

draft

mandatory

WG4

Components should have a unique identifier and a version number

abstract

draft

mandatory

WG4

Components must have a fully qualified name that follows the Java class naming conventions

concrete

final

mandatory

WG4

Components must associate themselves with categories defined by the OpenMinTeD project

abstract

final

mandatory

WG4

Components must declare their annotation schema dependencies

abstract

final

mandatory

WG4

Components should specify the types of the annotations that they input and output

abstract

draft

mandatory

WG4, WG2 (27)

Components must declare whether they can be scaled within a workflow

abstract

draft

mandatory

WG4

Components should provide documentation describing their functionality

abstract

final

recommended

WG4

Citation information for component should be included in the metadata

abstract

draft

recommended

WG1 (40), WG4

Models/resources should be useable across different component collections/platforms

abstract

final

recommended

WG4

Components should be stateless

concrete

final

recommended

WG4

Configuration and parametrizable options of the components should be identified and documented

abstract

final

recommended

WG4

It should be possible to determine the source of an annotation/assigned category

abstract

final

recommended

WG4

Processing components should be downloadable

abstract

final

recommended

WG4

Access mode of resources must be included in the metadata

abstract

final

mandatory

WG1 (40), WG2 (27), WG4

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

abstract

final

mandatory

WG1 (40), WG4

S/W (tools, web services, workflows) must indicate format of their output

abstract

final

mandatory

WG1 (40), WG4

Information on funding of resources may be included in the metadata

abstract

final

optional

Documentation references should be versioned

abstract

final

recommended

Stick to widely used data compression formats

concrete

draft

best practice

WG4

Machine-readable metadata for UIMA components

concrete

draft

mandatory

WG1 (40), WG4

Embedding UIMA component metadata into the source code

concrete

draft

mandatory

WG1 (40), WG4

Separating UIMA metadata from the component

concrete

draft

mandatory

WG1 (40), WG4

Specifying input and output types of UIMA components

concrete

draft

mandatory

WG1 (40), WG4

Documentation of UIMA components

concrete

draft

mandatory

WG1 (40), WG4

Embedding GATE component metadata into the source code

concrete

draft

mandatory

WG1 (40), WG4

Documentation of GATE components

concrete

draft

mandatory

WG1 (40), WG4

Separating GATE metadata from the component

concrete

draft

mandatory

WG1 (40), WG4

Embedding output format in UIMA component metadata

concrete

draft

mandatory

WG1 (40), WG4

Version documentation in parallel with component/resource

concrete

draft

recommended

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

concrete

draft

mandatory

WG1 (40), WG4

Encoding citable publications (for scholarly attribution) in resource metadata records

concrete

draft

recommended

WG1 (40), WG2 (27), WG4

Including license text in resource packages

concrete

draft

recommended

WG1 (40), WG3 (25), WG4

Unique identifiers and versions for components using Maven

concrete

draft

mandatory

WG1 (40), WG4

Declaring scaleout capability in UIMA

concrete

draft

mandatory

WG4

Publishing components via software repositories (Maven, Docker)

concrete

draft

mandatory

WG4

Making models and annotation resources accessible as entities distinct from the components they are compatible with

concrete

draft

recommended

WG1 (40), WG2 (27), WG4

Adding version information in the metadata descriptions of all resources

concrete

draft

mandatory

Specifying access mode of resources and encoding it in the metadata descriptions

concrete

draft

mandatory

WG1 (40), WG2 (27), WG4

Encoding funding information in the metadata descriptions of all resources

concrete

draft

recommended

WG1 (40), WG2 (27), WG4

Annotation schema dependencies for UIMA components using Maven

concrete

draft

mandatory

WG4

By Status

deprecated (27)

ID	Requirement	Concreteness	Strength	WG’s
14	Components must maintain License information	abstract	mandatory	WG4 (44)
15	Human readable information should be provided by each resource	abstract	recommended	WG1 (40), WG4 (44)
18	Workflows should be described using an uniform language	abstract	recommended	WG4 (44)
19	Components that use external knowledge resources should delegate access to a resource adapter instead of handling it themselves	abstract	optional	WG2 (27), WG4 (44)
20	Workflow engines should not require to see data	concrete	recommended	WG2 (27), WG4 (44)
22	The Workflow Engine Should Permit Saving Experimental Conditions in a Workflow	abstract	recommended	WG1 (40), WG4 (44)
23	The Workflow Engine should permit Licence Aggregation in Workflows	abstract	recommended	WG3 (25), WG4 (44)
24	Using/treating workflows as components	abstract	mandatory	WG4 (44)
25	Incorporation of multiple resources in parallel	abstract	recommended	WG4 (44)
27	Components should handle failures gracefully	abstract	recommended	WG4 (44)
29	The actual content of all content resources must be discoverable	abstract	mandatory	WG1 (40), WG2 (27), WG3 (25)
30	Metrics for the confidence level of the TDM operation should be included in the metadata	abstract	optional	WG1 (40), WG4 (44)
31	Metrics for the performance of the TDM operation should be included in the metadata	abstract	optional	WG1 (40), WG4 (44)
32	Version must be included in the metadata description for all resources	abstract	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
35	All resources must include a unique persistent identifier	abstract	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
40	Component metadata must include standardised categories/tags that make them easy to discover	abstract	mandatory	WG1 (40), WG4 (44)
42	The metadata can include the information on which projects/workflows involve the resource	abstract	optional	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
46	Output resources of web services/workflows must be accompanied by provenance metadata	abstract	mandatory	WG1 (40), WG4 (44)
48	All resource metadata records must include a reference to the metadata schema used for their description	abstract	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
49	Metadata of tools should contain information about the models available for them	abstract	recommended	WG1 (40), WG4 (44)
52	License information must be in metadata	abstract	recommended	WG1 (40), WG3 (25)
77	Unique identifiers and versions for UIMA components	concrete	mandatory	WG1 (40), WG4 (44)
80	Common elements to represent/describe an executable workflow	concrete	recommended	WG4 (44)
82	Unique identifiers and versions for GATE components	concrete	mandatory	WG1 (40), WG4 (44)
85	Unique identifier and version for components in the OMTD-SHARE schema	concrete	mandatory	WG1 (40), WG4 (44)
86	Attaching format properties to the description of the inputs and outputs that use file	concrete	recommended	WG1 (40), WG4 (44)
87	Embedding language capability in UIMA component metadata	concrete	mandatory	WG1 (40), WG4 (44)

Requirement

Concreteness

Strength

WG’s

Components must maintain License information

abstract

mandatory

Human readable information should be provided by each resource

abstract

recommended

Workflows should be described using an uniform language

abstract

recommended

Components that use external knowledge resources should delegate access to a resource adapter instead of handling it themselves

abstract

optional

Workflow engines should not require to see data

concrete

recommended

The Workflow Engine Should Permit Saving Experimental Conditions in a Workflow

abstract

recommended

The Workflow Engine should permit Licence Aggregation in Workflows

abstract

recommended

WG3 (25), WG4 (44)

Using/treating workflows as components

abstract

mandatory

Incorporation of multiple resources in parallel

abstract

recommended

Components should handle failures gracefully

abstract

recommended

The actual content of all content resources must be discoverable

abstract

mandatory

Metrics for the confidence level of the TDM operation should be included in the metadata

abstract

optional

Metrics for the performance of the TDM operation should be included in the metadata

abstract

optional

Version must be included in the metadata description for all resources

abstract

mandatory

All resources must include a unique persistent identifier

abstract

mandatory

Component metadata must include standardised categories/tags that make them easy to discover

abstract

mandatory

The metadata can include the information on which projects/workflows involve the resource

abstract

optional

Output resources of web services/workflows must be accompanied by provenance metadata

abstract

mandatory

All resource metadata records must include a reference to the metadata schema used for their description

abstract

mandatory

Metadata of tools should contain information about the models available for them

abstract

recommended

License information must be in metadata

abstract

recommended

Unique identifiers and versions for UIMA components

concrete

mandatory

Common elements to represent/describe an executable workflow

concrete

recommended

Unique identifiers and versions for GATE components

concrete

mandatory

Unique identifier and version for components in the OMTD-SHARE schema

concrete

mandatory

Attaching format properties to the description of the inputs and outputs that use file

concrete

recommended

Embedding language capability in UIMA component metadata

concrete

mandatory

draft (53)

ID	Requirement	Concreteness	Strength	WG’s
5	Components must detail all their environmental requirements for execution	abstract	mandatory	WG4 (44)
6	Components should have a unique identifier and a version number	abstract	mandatory	WG4 (44)
10	Components should specify the types of the annotations that they input and output	abstract	mandatory	WG4 (44), WG2 (27)
11	Components must declare whether they can be scaled within a workflow	abstract	mandatory	WG4 (44)
13	Citation information for component should be included in the metadata	abstract	recommended	WG1 (40), WG4 (44)
51	License should be attached	abstract	recommended	WG3 (25)
53	Licensor must be entitled to grant license	abstract	recommended	WG3 (25)
54	Licensees should remain with a copy of the license	abstract	recommended	WG3 (25)
55	Standard licenses should be used	abstract	recommended	WG3 (25)
56	License should be machine readable	abstract	recommended	WG3 (25)
57	License should be understandable by non-lawyers	abstract	recommended	WG3 (25)
58	TDM must be explicitly allowed	abstract	recommended	WG3 (25)
59	Right for (temporary) reproduction must be granted	abstract	recommended	WG3 (25)
60	Boundary for derivative work must be clearly defined	abstract	recommended	WG3 (25)
61	No restrictions on TDM results which are not derived works	abstract	recommended	WG3 (25)
62	World-wide and irrevocable license grant	abstract	recommended	WG3 (25)
63	License must qualify for Open Access rights	abstract	recommended	WG3 (25)
64	License must qualify for Open Access uses	abstract	recommended	WG3 (25)
65	License must qualify for Open Access must not restrict use in any way	abstract	recommended	WG3 (25)
66	License must qualify for Open Access may include attribution requirements	abstract	recommended	WG3 (25)
73	Stick to widely used data compression formats	concrete	best practice	WG4 (44)
74	Machine-readable metadata for UIMA components	concrete	mandatory	WG1 (40), WG4 (44)
75	Embedding UIMA component metadata into the source code	concrete	mandatory	WG1 (40), WG4 (44)
76	Separating UIMA metadata from the component	concrete	mandatory	WG1 (40), WG4 (44)
78	Specifying input and output types of UIMA components	concrete	mandatory	WG1 (40), WG4 (44)
79	Documentation of UIMA components	concrete	mandatory	WG1 (40), WG4 (44)
81	Embedding GATE component metadata into the source code	concrete	mandatory	WG1 (40), WG4 (44)
83	Documentation of GATE components	concrete	mandatory	WG1 (40), WG4 (44)
84	Separating GATE metadata from the component	concrete	mandatory	WG1 (40), WG4 (44)
88	Embedding output format in UIMA component metadata	concrete	mandatory	WG1 (40), WG4 (44)
89	Version documentation in parallel with component/resource	concrete	recommended	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	concrete	mandatory	WG1 (40), WG4 (44)
91	Encoding citable publications (for scholarly attribution) in resource metadata records	concrete	recommended	WG1 (40), WG2 (27), WG4 (44)
92	Including license text in resource packages	concrete	recommended	WG1 (40), WG3 (25), WG4 (44)
93	Provide identifiers for knowledge resource elements	concrete	recommended	WG2 (27)
94	Data Category Linking Vocabulary	concrete	recommended	WG2 (27)
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	concrete	recommended	WG2 (27)
96	Unique identifiers and versions for components using Maven	concrete	mandatory	WG1 (40), WG4 (44)
97	Declaring scaleout capability in UIMA	concrete	mandatory	WG4 (44)
98	Publishing components via software repositories (Maven, Docker)	concrete	mandatory	WG4 (44)
99	Encoding in the metadata a direct access link for content resources	concrete	mandatory	WG1 (40)
100	Providing access to content resources (sharing/exposing and transferring)	concrete	mandatory	WG1 (40)
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	concrete	recommended	WG1 (40), WG2 (27), WG4 (44)
102	Adding version information in the metadata descriptions of all resources	concrete	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
103	Specifying access mode of resources and encoding it in the metadata descriptions	concrete	mandatory	WG1 (40), WG2 (27), WG4 (44)
104	Encoding funding information in the metadata descriptions of all resources	concrete	recommended	WG1 (40), WG2 (27), WG4 (44)
105	Encoding of format in the metadata description of content resources	concrete	mandatory	WG1 (40)
106	Encoding licensing terms in the metadata description of the resource	concrete	mandatory	WG1 (40), WG3 (25)
107	Encoding metadata on domain/subject/ classification for all resources when applicable	concrete	recommended	WG1 (40), WG2 (27)
108	Encoding language information in the metadata of content resources	concrete	mandatory	WG1 (40), WG2 (27)
109	Encoding statistical information in the content resources	concrete	mandatory	WG1 (40), WG2 (27)
110	Assigning a unique persistent identifier for all resources	concrete	mandatory	WG1 (40), WG2 (27), WG3 (25)
111	Annotation schema dependencies for UIMA components using Maven	concrete	mandatory	WG4 (44)

Requirement

Concreteness

Strength

WG’s

Components must detail all their environmental requirements for execution

abstract

mandatory

Components should have a unique identifier and a version number

abstract

mandatory

Components should specify the types of the annotations that they input and output

abstract

mandatory

WG4 (44), WG2 (27)

Components must declare whether they can be scaled within a workflow

abstract

mandatory

Citation information for component should be included in the metadata

abstract

recommended

License should be attached

abstract

recommended

Licensor must be entitled to grant license

abstract

recommended

Licensees should remain with a copy of the license

abstract

recommended

Standard licenses should be used

abstract

recommended

License should be machine readable

abstract

recommended

License should be understandable by non-lawyers

abstract

recommended

TDM must be explicitly allowed

abstract

recommended

Right for (temporary) reproduction must be granted

abstract

recommended

Boundary for derivative work must be clearly defined

abstract

recommended

No restrictions on TDM results which are not derived works

abstract

recommended

World-wide and irrevocable license grant

abstract

recommended

License must qualify for Open Access rights

abstract

recommended

License must qualify for Open Access uses

abstract

recommended

License must qualify for Open Access must not restrict use in any way

abstract

recommended

License must qualify for Open Access may include attribution requirements

abstract

recommended

Stick to widely used data compression formats

concrete

best practice

Machine-readable metadata for UIMA components

concrete

mandatory

Embedding UIMA component metadata into the source code

concrete

mandatory

Separating UIMA metadata from the component

concrete

mandatory

Specifying input and output types of UIMA components

concrete

mandatory

Documentation of UIMA components

concrete

mandatory

Embedding GATE component metadata into the source code

concrete

mandatory

Documentation of GATE components

concrete

mandatory

Separating GATE metadata from the component

concrete

mandatory

Embedding output format in UIMA component metadata

concrete

mandatory

Version documentation in parallel with component/resource

concrete

recommended

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

concrete

mandatory

Encoding citable publications (for scholarly attribution) in resource metadata records

concrete

recommended

Including license text in resource packages

concrete

recommended

WG1 (40), WG3 (25), WG4 (44)

Provide identifiers for knowledge resource elements

concrete

recommended

Data Category Linking Vocabulary

concrete

recommended

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

concrete

recommended

Unique identifiers and versions for components using Maven

concrete

mandatory

Declaring scaleout capability in UIMA

concrete

mandatory

Publishing components via software repositories (Maven, Docker)

concrete

mandatory

Encoding in the metadata a direct access link for content resources

concrete

mandatory

Providing access to content resources (sharing/exposing and transferring)

concrete

mandatory

Making models and annotation resources accessible as entities distinct from the components they are compatible with

concrete

recommended

Adding version information in the metadata descriptions of all resources

concrete

mandatory

Specifying access mode of resources and encoding it in the metadata descriptions

concrete

mandatory

Encoding funding information in the metadata descriptions of all resources

concrete

recommended

Encoding of format in the metadata description of content resources

concrete

mandatory

Encoding licensing terms in the metadata description of the resource

concrete

mandatory

Encoding metadata on domain/subject/ classification for all resources when applicable

concrete

recommended

Encoding language information in the metadata of content resources

concrete

mandatory

Encoding statistical information in the content resources

concrete

mandatory

Assigning a unique persistent identifier for all resources

concrete

mandatory

Annotation schema dependencies for UIMA components using Maven

concrete

mandatory

final (31)

ID	Requirement	Concreteness	Strength	WG’s
1	Components must be described by machine-readable metadata	abstract	mandatory	WG4 (44)
2	Component metadata have to be embedded into the component source code	abstract	mandatory	WG4 (44)
3	Component metadata must be separable from the component	abstract	mandatory	WG4 (44)
4	URL to actual content must be discoverable	abstract	mandatory	WG1 (40), WG2 (27), WG3 (25)
7	Components must have a fully qualified name that follows the Java class naming conventions	concrete	mandatory	WG4 (44)
8	Components must associate themselves with categories defined by the OpenMinTeD project	abstract	mandatory	WG4 (44)
9	Components must declare their annotation schema dependencies	abstract	mandatory	WG4 (44)
12	Components should provide documentation describing their functionality	abstract	recommended	WG4 (44)
16	Models/resources should be useable across different component collections/platforms	abstract	recommended	WG4 (44)
17	Components should be stateless	concrete	recommended	WG4 (44)
21	Configuration and parametrizable options of the components should be identified and documented	abstract	recommended	WG4 (44)
26	It should be possible to determine the source of an annotation/assigned category	abstract	recommended	WG4 (44)
28	Processing components should be downloadable	abstract	recommended	WG4 (44)
33	Licensing information must be included in the metadata	abstract	mandatory	WG1 (40), WG3 (25)
34	Licensing information should be expressed in a machine-readable form	abstract	recommended	WG1 (40), WG3 (25)
36	Classification metadata should be included, where applicable, in the metadata record of the resource	abstract	recommended	WG1 (40), WG2 (27)
37	Information on the structural annotation (layout) of resources should be included in the metadata of the resource	abstract	recommended	WG1 (40)
38	Access mode of resources must be included in the metadata	abstract	mandatory	WG1 (40), WG2 (27), WG4 (44)
39	Content resources must include metadata on their format (e.g. XML, DOCX etc.)	abstract	mandatory	WG1 (40)
41	Content resources must include metadata on their language(s)	abstract	mandatory	WG1 (40), WG2 (27)
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	abstract	mandatory	WG1 (40), WG4 (44)
44	Statistical metadata that allow monitoring of resource versions may accompany resources	abstract	optional	WG1 (40), WG2 (27)
45	S/W (tools, web services, workflows) must indicate format of their output	abstract	mandatory	WG1 (40), WG4 (44)
47	Information on funding of resources may be included in the metadata	abstract	optional	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
50	Documentation references should be versioned	abstract	recommended	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
67	Knowledge Resource Element Id	abstract	recommended	WG2 (27)
68	Data Category Linking Vocabulary	abstract	recommended	WG2 (27)
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	abstract	recommended	WG2 (27)
70	All KR content elements need to be added as text annotations within a TDM workflow.	abstract	mandatory	WG2 (27)
71	The KR should be ingestible through a URI	abstract	recommended	WG2 (27)
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	abstract	recommended	WG2 (27)

Requirement

Concreteness

Strength

WG’s

Components must be described by machine-readable metadata

abstract

mandatory

Component metadata have to be embedded into the component source code

abstract

mandatory

Component metadata must be separable from the component

abstract

mandatory

URL to actual content must be discoverable

abstract

mandatory

Components must have a fully qualified name that follows the Java class naming conventions

concrete

mandatory

Components must associate themselves with categories defined by the OpenMinTeD project

abstract

mandatory

Components must declare their annotation schema dependencies

abstract

mandatory

Components should provide documentation describing their functionality

abstract

recommended

Models/resources should be useable across different component collections/platforms

abstract

recommended

Components should be stateless

concrete

recommended

Configuration and parametrizable options of the components should be identified and documented

abstract

recommended

It should be possible to determine the source of an annotation/assigned category

abstract

recommended

Processing components should be downloadable

abstract

recommended

Licensing information must be included in the metadata

abstract

mandatory

Licensing information should be expressed in a machine-readable form

abstract

recommended

Classification metadata should be included, where applicable, in the metadata record of the resource

abstract

recommended

Information on the structural annotation (layout) of resources should be included in the metadata of the resource

abstract

recommended

Access mode of resources must be included in the metadata

abstract

mandatory

Content resources must include metadata on their format (e.g. XML, DOCX etc.)

abstract

mandatory

Content resources must include metadata on their language(s)

abstract

mandatory

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

abstract

mandatory

Statistical metadata that allow monitoring of resource versions may accompany resources

abstract

optional

S/W (tools, web services, workflows) must indicate format of their output

abstract

mandatory

Information on funding of resources may be included in the metadata

abstract

optional

Documentation references should be versioned

abstract

recommended

Knowledge Resource Element Id

abstract

recommended

Data Category Linking Vocabulary

abstract

recommended

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

abstract

recommended

All KR content elements need to be added as text annotations within a TDM workflow.

abstract

mandatory

The KR should be ingestible through a URI

abstract

recommended

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

abstract

recommended

By Strength

[[STR-best practice]] === best practice (1)

ID	Requirement	Concreteness	Status	WG’s
73	Stick to widely used data compression formats	concrete	draft	WG4 (44)

Requirement

Concreteness

Status

WG’s

Stick to widely used data compression formats

concrete

draft

mandatory (53)

ID	Requirement	Concreteness	Status	WG’s
1	Components must be described by machine-readable metadata	abstract	final	WG4 (44)
2	Component metadata have to be embedded into the component source code	abstract	final	WG4 (44)
3	Component metadata must be separable from the component	abstract	final	WG4 (44)
4	URL to actual content must be discoverable	abstract	final	WG1 (40), WG2 (27), WG3 (25)
5	Components must detail all their environmental requirements for execution	abstract	draft	WG4 (44)
6	Components should have a unique identifier and a version number	abstract	draft	WG4 (44)
7	Components must have a fully qualified name that follows the Java class naming conventions	concrete	final	WG4 (44)
8	Components must associate themselves with categories defined by the OpenMinTeD project	abstract	final	WG4 (44)
9	Components must declare their annotation schema dependencies	abstract	final	WG4 (44)
10	Components should specify the types of the annotations that they input and output	abstract	draft	WG4 (44), WG2 (27)
11	Components must declare whether they can be scaled within a workflow	abstract	draft	WG4 (44)
14	Components must maintain License information	abstract	deprecated	WG4 (44)
24	Using/treating workflows as components	abstract	deprecated	WG4 (44)
29	The actual content of all content resources must be discoverable	abstract	deprecated	WG1 (40), WG2 (27), WG3 (25)
32	Version must be included in the metadata description for all resources	abstract	deprecated	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
33	Licensing information must be included in the metadata	abstract	final	WG1 (40), WG3 (25)
35	All resources must include a unique persistent identifier	abstract	deprecated	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
38	Access mode of resources must be included in the metadata	abstract	final	WG1 (40), WG2 (27), WG4 (44)
39	Content resources must include metadata on their format (e.g. XML, DOCX etc.)	abstract	final	WG1 (40)
40	Component metadata must include standardised categories/tags that make them easy to discover	abstract	deprecated	WG1 (40), WG4 (44)
41	Content resources must include metadata on their language(s)	abstract	final	WG1 (40), WG2 (27)
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	abstract	final	WG1 (40), WG4 (44)
45	S/W (tools, web services, workflows) must indicate format of their output	abstract	final	WG1 (40), WG4 (44)
46	Output resources of web services/workflows must be accompanied by provenance metadata	abstract	deprecated	WG1 (40), WG4 (44)
48	All resource metadata records must include a reference to the metadata schema used for their description	abstract	deprecated	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
70	All KR content elements need to be added as text annotations within a TDM workflow.	abstract	final	WG2 (27)
74	Machine-readable metadata for UIMA components	concrete	draft	WG1 (40), WG4 (44)
75	Embedding UIMA component metadata into the source code	concrete	draft	WG1 (40), WG4 (44)
76	Separating UIMA metadata from the component	concrete	draft	WG1 (40), WG4 (44)
77	Unique identifiers and versions for UIMA components	concrete	deprecated	WG1 (40), WG4 (44)
78	Specifying input and output types of UIMA components	concrete	draft	WG1 (40), WG4 (44)
79	Documentation of UIMA components	concrete	draft	WG1 (40), WG4 (44)
81	Embedding GATE component metadata into the source code	concrete	draft	WG1 (40), WG4 (44)
82	Unique identifiers and versions for GATE components	concrete	deprecated	WG1 (40), WG4 (44)
83	Documentation of GATE components	concrete	draft	WG1 (40), WG4 (44)
84	Separating GATE metadata from the component	concrete	draft	WG1 (40), WG4 (44)
85	Unique identifier and version for components in the OMTD-SHARE schema	concrete	deprecated	WG1 (40), WG4 (44)
87	Embedding language capability in UIMA component metadata	concrete	deprecated	WG1 (40), WG4 (44)
88	Embedding output format in UIMA component metadata	concrete	draft	WG1 (40), WG4 (44)
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	concrete	draft	WG1 (40), WG4 (44)
96	Unique identifiers and versions for components using Maven	concrete	draft	WG1 (40), WG4 (44)
97	Declaring scaleout capability in UIMA	concrete	draft	WG4 (44)
98	Publishing components via software repositories (Maven, Docker)	concrete	draft	WG4 (44)
99	Encoding in the metadata a direct access link for content resources	concrete	draft	WG1 (40)
100	Providing access to content resources (sharing/exposing and transferring)	concrete	draft	WG1 (40)
102	Adding version information in the metadata descriptions of all resources	concrete	draft	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
103	Specifying access mode of resources and encoding it in the metadata descriptions	concrete	draft	WG1 (40), WG2 (27), WG4 (44)
105	Encoding of format in the metadata description of content resources	concrete	draft	WG1 (40)
106	Encoding licensing terms in the metadata description of the resource	concrete	draft	WG1 (40), WG3 (25)
108	Encoding language information in the metadata of content resources	concrete	draft	WG1 (40), WG2 (27)
109	Encoding statistical information in the content resources	concrete	draft	WG1 (40), WG2 (27)
110	Assigning a unique persistent identifier for all resources	concrete	draft	WG1 (40), WG2 (27), WG3 (25)
111	Annotation schema dependencies for UIMA components using Maven	concrete	draft	WG4 (44)

Requirement

Concreteness

Status

WG’s

Components must be described by machine-readable metadata

abstract

final

Component metadata have to be embedded into the component source code

abstract

final

Component metadata must be separable from the component

abstract

final

URL to actual content must be discoverable

abstract

final

Components must detail all their environmental requirements for execution

abstract

draft

Components should have a unique identifier and a version number

abstract

draft

Components must have a fully qualified name that follows the Java class naming conventions

concrete

final

Components must associate themselves with categories defined by the OpenMinTeD project

abstract

final

Components must declare their annotation schema dependencies

abstract

final

Components should specify the types of the annotations that they input and output

abstract

draft

WG4 (44), WG2 (27)

Components must declare whether they can be scaled within a workflow

abstract

draft

Components must maintain License information

abstract

deprecated

Using/treating workflows as components

abstract

deprecated

The actual content of all content resources must be discoverable

abstract

deprecated

Version must be included in the metadata description for all resources

abstract

deprecated

Licensing information must be included in the metadata

abstract

final

All resources must include a unique persistent identifier

abstract

deprecated

Access mode of resources must be included in the metadata

abstract

final

Content resources must include metadata on their format (e.g. XML, DOCX etc.)

abstract

final

Component metadata must include standardised categories/tags that make them easy to discover

abstract

deprecated

Content resources must include metadata on their language(s)

abstract

final

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

abstract

final

S/W (tools, web services, workflows) must indicate format of their output

abstract

final

Output resources of web services/workflows must be accompanied by provenance metadata

abstract

deprecated

All resource metadata records must include a reference to the metadata schema used for their description

abstract

deprecated

All KR content elements need to be added as text annotations within a TDM workflow.

abstract

final

Machine-readable metadata for UIMA components

concrete

draft

Embedding UIMA component metadata into the source code

concrete

draft

Separating UIMA metadata from the component

concrete

draft

Unique identifiers and versions for UIMA components

concrete

deprecated

Specifying input and output types of UIMA components

concrete

draft

Documentation of UIMA components

concrete

draft

Embedding GATE component metadata into the source code

concrete

draft

Unique identifiers and versions for GATE components

concrete

deprecated

Documentation of GATE components

concrete

draft

Separating GATE metadata from the component

concrete

draft

Unique identifier and version for components in the OMTD-SHARE schema

concrete

deprecated

Embedding language capability in UIMA component metadata

concrete

deprecated

Embedding output format in UIMA component metadata

concrete

draft

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

concrete

draft

Unique identifiers and versions for components using Maven

concrete

draft

Declaring scaleout capability in UIMA

concrete

draft

Publishing components via software repositories (Maven, Docker)

concrete

draft

Encoding in the metadata a direct access link for content resources

concrete

draft

Providing access to content resources (sharing/exposing and transferring)

concrete

draft

Adding version information in the metadata descriptions of all resources

concrete

draft

Specifying access mode of resources and encoding it in the metadata descriptions

concrete

draft

Encoding of format in the metadata description of content resources

concrete

draft

Encoding licensing terms in the metadata description of the resource

concrete

draft

Encoding language information in the metadata of content resources

concrete

draft

Encoding statistical information in the content resources

concrete

draft

Assigning a unique persistent identifier for all resources

concrete

draft

Annotation schema dependencies for UIMA components using Maven

concrete

draft

optional (6)

ID	Requirement	Concreteness	Status	WG’s
19	Components that use external knowledge resources should delegate access to a resource adapter instead of handling it themselves	abstract	deprecated	WG2 (27), WG4 (44)
30	Metrics for the confidence level of the TDM operation should be included in the metadata	abstract	deprecated	WG1 (40), WG4 (44)
31	Metrics for the performance of the TDM operation should be included in the metadata	abstract	deprecated	WG1 (40), WG4 (44)
42	The metadata can include the information on which projects/workflows involve the resource	abstract	deprecated	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
44	Statistical metadata that allow monitoring of resource versions may accompany resources	abstract	final	WG1 (40), WG2 (27)
47	Information on funding of resources may be included in the metadata	abstract	final	WG1 (40), WG2 (27), WG3 (25), WG4 (44)

Requirement

Concreteness

Status

WG’s

Components that use external knowledge resources should delegate access to a resource adapter instead of handling it themselves

abstract

deprecated

Metrics for the confidence level of the TDM operation should be included in the metadata

abstract

deprecated

Metrics for the performance of the TDM operation should be included in the metadata

abstract

deprecated

The metadata can include the information on which projects/workflows involve the resource

abstract

deprecated

Statistical metadata that allow monitoring of resource versions may accompany resources

abstract

final

Information on funding of resources may be included in the metadata

abstract

final

recommended (51)

ID	Requirement	Concreteness	Status	WG’s
12	Components should provide documentation describing their functionality	abstract	final	WG4 (44)
13	Citation information for component should be included in the metadata	abstract	draft	WG1 (40), WG4 (44)
15	Human readable information should be provided by each resource	abstract	deprecated	WG1 (40), WG4 (44)
16	Models/resources should be useable across different component collections/platforms	abstract	final	WG4 (44)
17	Components should be stateless	concrete	final	WG4 (44)
18	Workflows should be described using an uniform language	abstract	deprecated	WG4 (44)
20	Workflow engines should not require to see data	concrete	deprecated	WG2 (27), WG4 (44)
21	Configuration and parametrizable options of the components should be identified and documented	abstract	final	WG4 (44)
22	The Workflow Engine Should Permit Saving Experimental Conditions in a Workflow	abstract	deprecated	WG1 (40), WG4 (44)
23	The Workflow Engine should permit Licence Aggregation in Workflows	abstract	deprecated	WG3 (25), WG4 (44)
25	Incorporation of multiple resources in parallel	abstract	deprecated	WG4 (44)
26	It should be possible to determine the source of an annotation/assigned category	abstract	final	WG4 (44)
27	Components should handle failures gracefully	abstract	deprecated	WG4 (44)
28	Processing components should be downloadable	abstract	final	WG4 (44)
34	Licensing information should be expressed in a machine-readable form	abstract	final	WG1 (40), WG3 (25)
36	Classification metadata should be included, where applicable, in the metadata record of the resource	abstract	final	WG1 (40), WG2 (27)
37	Information on the structural annotation (layout) of resources should be included in the metadata of the resource	abstract	final	WG1 (40)
49	Metadata of tools should contain information about the models available for them	abstract	deprecated	WG1 (40), WG4 (44)
50	Documentation references should be versioned	abstract	final	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
51	License should be attached	abstract	draft	WG3 (25)
52	License information must be in metadata	abstract	deprecated	WG1 (40), WG3 (25)
53	Licensor must be entitled to grant license	abstract	draft	WG3 (25)
54	Licensees should remain with a copy of the license	abstract	draft	WG3 (25)
55	Standard licenses should be used	abstract	draft	WG3 (25)
56	License should be machine readable	abstract	draft	WG3 (25)
57	License should be understandable by non-lawyers	abstract	draft	WG3 (25)
58	TDM must be explicitly allowed	abstract	draft	WG3 (25)
59	Right for (temporary) reproduction must be granted	abstract	draft	WG3 (25)
60	Boundary for derivative work must be clearly defined	abstract	draft	WG3 (25)
61	No restrictions on TDM results which are not derived works	abstract	draft	WG3 (25)
62	World-wide and irrevocable license grant	abstract	draft	WG3 (25)
63	License must qualify for Open Access rights	abstract	draft	WG3 (25)
64	License must qualify for Open Access uses	abstract	draft	WG3 (25)
65	License must qualify for Open Access must not restrict use in any way	abstract	draft	WG3 (25)
66	License must qualify for Open Access may include attribution requirements	abstract	draft	WG3 (25)
67	Knowledge Resource Element Id	abstract	final	WG2 (27)
68	Data Category Linking Vocabulary	abstract	final	WG2 (27)
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	abstract	final	WG2 (27)
71	The KR should be ingestible through a URI	abstract	final	WG2 (27)
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	abstract	final	WG2 (27)
80	Common elements to represent/describe an executable workflow	concrete	deprecated	WG4 (44)
86	Attaching format properties to the description of the inputs and outputs that use file	concrete	deprecated	WG1 (40), WG4 (44)
89	Version documentation in parallel with component/resource	concrete	draft	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
91	Encoding citable publications (for scholarly attribution) in resource metadata records	concrete	draft	WG1 (40), WG2 (27), WG4 (44)
92	Including license text in resource packages	concrete	draft	WG1 (40), WG3 (25), WG4 (44)
93	Provide identifiers for knowledge resource elements	concrete	draft	WG2 (27)
94	Data Category Linking Vocabulary	concrete	draft	WG2 (27)
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	concrete	draft	WG2 (27)
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	concrete	draft	WG1 (40), WG2 (27), WG4 (44)
104	Encoding funding information in the metadata descriptions of all resources	concrete	draft	WG1 (40), WG2 (27), WG4 (44)
107	Encoding metadata on domain/subject/ classification for all resources when applicable	concrete	draft	WG1 (40), WG2 (27)

Requirement

Concreteness

Status

WG’s

Components should provide documentation describing their functionality

abstract

final

Citation information for component should be included in the metadata

abstract

draft

Human readable information should be provided by each resource

abstract

deprecated

Models/resources should be useable across different component collections/platforms

abstract

final

Components should be stateless

concrete

final

Workflows should be described using an uniform language

abstract

deprecated

Workflow engines should not require to see data

concrete

deprecated

Configuration and parametrizable options of the components should be identified and documented

abstract

final

The Workflow Engine Should Permit Saving Experimental Conditions in a Workflow

abstract

deprecated

The Workflow Engine should permit Licence Aggregation in Workflows

abstract

deprecated

WG3 (25), WG4 (44)

Incorporation of multiple resources in parallel

abstract

deprecated

It should be possible to determine the source of an annotation/assigned category

abstract

final

Components should handle failures gracefully

abstract

deprecated

Processing components should be downloadable

abstract

final

Licensing information should be expressed in a machine-readable form

abstract

final

Classification metadata should be included, where applicable, in the metadata record of the resource

abstract

final

Information on the structural annotation (layout) of resources should be included in the metadata of the resource

abstract

final

Metadata of tools should contain information about the models available for them

abstract

deprecated

Documentation references should be versioned

abstract

final

License should be attached

abstract

draft

License information must be in metadata

abstract

deprecated

Licensor must be entitled to grant license

abstract

draft

Licensees should remain with a copy of the license

abstract

draft

Standard licenses should be used

abstract

draft

License should be machine readable

abstract

draft

License should be understandable by non-lawyers

abstract

draft

TDM must be explicitly allowed

abstract

draft

Right for (temporary) reproduction must be granted

abstract

draft

Boundary for derivative work must be clearly defined

abstract

draft

No restrictions on TDM results which are not derived works

abstract

draft

World-wide and irrevocable license grant

abstract

draft

License must qualify for Open Access rights

abstract

draft

License must qualify for Open Access uses

abstract

draft

License must qualify for Open Access must not restrict use in any way

abstract

draft

License must qualify for Open Access may include attribution requirements

abstract

draft

Knowledge Resource Element Id

abstract

final

Data Category Linking Vocabulary

abstract

final

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

abstract

final

The KR should be ingestible through a URI

abstract

final

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

abstract

final

Common elements to represent/describe an executable workflow

concrete

deprecated

Attaching format properties to the description of the inputs and outputs that use file

concrete

deprecated

Version documentation in parallel with component/resource

concrete

draft

Encoding citable publications (for scholarly attribution) in resource metadata records

concrete

draft

Including license text in resource packages

concrete

draft

WG1 (40), WG3 (25), WG4 (44)

Provide identifiers for knowledge resource elements

concrete

draft

Data Category Linking Vocabulary

concrete

draft

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

concrete

draft

Making models and annotation resources accessible as entities distinct from the components they are compatible with

concrete

draft

Encoding funding information in the metadata descriptions of all resources

concrete

draft

Encoding metadata on domain/subject/ classification for all resources when applicable

concrete

draft

By Concreteness

abstract (69)

ID	Requirement	Status	Strength	WG’s
1	Components must be described by machine-readable metadata	final	mandatory	WG4 (44)
2	Component metadata have to be embedded into the component source code	final	mandatory	WG4 (44)
3	Component metadata must be separable from the component	final	mandatory	WG4 (44)
4	URL to actual content must be discoverable	final	mandatory	WG1 (40), WG2 (27), WG3 (25)
5	Components must detail all their environmental requirements for execution	draft	mandatory	WG4 (44)
6	Components should have a unique identifier and a version number	draft	mandatory	WG4 (44)
8	Components must associate themselves with categories defined by the OpenMinTeD project	final	mandatory	WG4 (44)
9	Components must declare their annotation schema dependencies	final	mandatory	WG4 (44)
10	Components should specify the types of the annotations that they input and output	draft	mandatory	WG4 (44), WG2 (27)
11	Components must declare whether they can be scaled within a workflow	draft	mandatory	WG4 (44)
12	Components should provide documentation describing their functionality	final	recommended	WG4 (44)
13	Citation information for component should be included in the metadata	draft	recommended	WG1 (40), WG4 (44)
14	Components must maintain License information	deprecated	mandatory	WG4 (44)
15	Human readable information should be provided by each resource	deprecated	recommended	WG1 (40), WG4 (44)
16	Models/resources should be useable across different component collections/platforms	final	recommended	WG4 (44)
18	Workflows should be described using an uniform language	deprecated	recommended	WG4 (44)
19	Components that use external knowledge resources should delegate access to a resource adapter instead of handling it themselves	deprecated	optional	WG2 (27), WG4 (44)
21	Configuration and parametrizable options of the components should be identified and documented	final	recommended	WG4 (44)
22	The Workflow Engine Should Permit Saving Experimental Conditions in a Workflow	deprecated	recommended	WG1 (40), WG4 (44)
23	The Workflow Engine should permit Licence Aggregation in Workflows	deprecated	recommended	WG3 (25), WG4 (44)
24	Using/treating workflows as components	deprecated	mandatory	WG4 (44)
25	Incorporation of multiple resources in parallel	deprecated	recommended	WG4 (44)
26	It should be possible to determine the source of an annotation/assigned category	final	recommended	WG4 (44)
27	Components should handle failures gracefully	deprecated	recommended	WG4 (44)
28	Processing components should be downloadable	final	recommended	WG4 (44)
29	The actual content of all content resources must be discoverable	deprecated	mandatory	WG1 (40), WG2 (27), WG3 (25)
30	Metrics for the confidence level of the TDM operation should be included in the metadata	deprecated	optional	WG1 (40), WG4 (44)
31	Metrics for the performance of the TDM operation should be included in the metadata	deprecated	optional	WG1 (40), WG4 (44)
32	Version must be included in the metadata description for all resources	deprecated	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
33	Licensing information must be included in the metadata	final	mandatory	WG1 (40), WG3 (25)
34	Licensing information should be expressed in a machine-readable form	final	recommended	WG1 (40), WG3 (25)
35	All resources must include a unique persistent identifier	deprecated	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
36	Classification metadata should be included, where applicable, in the metadata record of the resource	final	recommended	WG1 (40), WG2 (27)
37	Information on the structural annotation (layout) of resources should be included in the metadata of the resource	final	recommended	WG1 (40)
38	Access mode of resources must be included in the metadata	final	mandatory	WG1 (40), WG2 (27), WG4 (44)
39	Content resources must include metadata on their format (e.g. XML, DOCX etc.)	final	mandatory	WG1 (40)
40	Component metadata must include standardised categories/tags that make them easy to discover	deprecated	mandatory	WG1 (40), WG4 (44)
41	Content resources must include metadata on their language(s)	final	mandatory	WG1 (40), WG2 (27)
42	The metadata can include the information on which projects/workflows involve the resource	deprecated	optional	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	final	mandatory	WG1 (40), WG4 (44)
44	Statistical metadata that allow monitoring of resource versions may accompany resources	final	optional	WG1 (40), WG2 (27)
45	S/W (tools, web services, workflows) must indicate format of their output	final	mandatory	WG1 (40), WG4 (44)
46	Output resources of web services/workflows must be accompanied by provenance metadata	deprecated	mandatory	WG1 (40), WG4 (44)
47	Information on funding of resources may be included in the metadata	final	optional	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
48	All resource metadata records must include a reference to the metadata schema used for their description	deprecated	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
49	Metadata of tools should contain information about the models available for them	deprecated	recommended	WG1 (40), WG4 (44)
50	Documentation references should be versioned	final	recommended	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
51	License should be attached	draft	recommended	WG3 (25)
52	License information must be in metadata	deprecated	recommended	WG1 (40), WG3 (25)
53	Licensor must be entitled to grant license	draft	recommended	WG3 (25)
54	Licensees should remain with a copy of the license	draft	recommended	WG3 (25)
55	Standard licenses should be used	draft	recommended	WG3 (25)
56	License should be machine readable	draft	recommended	WG3 (25)
57	License should be understandable by non-lawyers	draft	recommended	WG3 (25)
58	TDM must be explicitly allowed	draft	recommended	WG3 (25)
59	Right for (temporary) reproduction must be granted	draft	recommended	WG3 (25)
60	Boundary for derivative work must be clearly defined	draft	recommended	WG3 (25)
61	No restrictions on TDM results which are not derived works	draft	recommended	WG3 (25)
62	World-wide and irrevocable license grant	draft	recommended	WG3 (25)
63	License must qualify for Open Access rights	draft	recommended	WG3 (25)
64	License must qualify for Open Access uses	draft	recommended	WG3 (25)
65	License must qualify for Open Access must not restrict use in any way	draft	recommended	WG3 (25)
66	License must qualify for Open Access may include attribution requirements	draft	recommended	WG3 (25)
67	Knowledge Resource Element Id	final	recommended	WG2 (27)
68	Data Category Linking Vocabulary	final	recommended	WG2 (27)
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	final	recommended	WG2 (27)
70	All KR content elements need to be added as text annotations within a TDM workflow.	final	mandatory	WG2 (27)
71	The KR should be ingestible through a URI	final	recommended	WG2 (27)
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	final	recommended	WG2 (27)

Requirement

Status

Strength

WG’s

Components must be described by machine-readable metadata

final

mandatory

Component metadata have to be embedded into the component source code

final

mandatory

Component metadata must be separable from the component

final

mandatory

URL to actual content must be discoverable

final

mandatory

Components must detail all their environmental requirements for execution

draft

mandatory

Components should have a unique identifier and a version number

draft

mandatory

Components must associate themselves with categories defined by the OpenMinTeD project

final

mandatory

Components must declare their annotation schema dependencies

final

mandatory

Components should specify the types of the annotations that they input and output

draft

mandatory

WG4 (44), WG2 (27)

Components must declare whether they can be scaled within a workflow

draft

mandatory

Components should provide documentation describing their functionality

final

recommended

Citation information for component should be included in the metadata

draft

recommended

Components must maintain License information

deprecated

mandatory

Human readable information should be provided by each resource

deprecated

recommended

Models/resources should be useable across different component collections/platforms

final

recommended

Workflows should be described using an uniform language

deprecated

recommended

Components that use external knowledge resources should delegate access to a resource adapter instead of handling it themselves

deprecated

optional

Configuration and parametrizable options of the components should be identified and documented

final

recommended

The Workflow Engine Should Permit Saving Experimental Conditions in a Workflow

deprecated

recommended

The Workflow Engine should permit Licence Aggregation in Workflows

deprecated

recommended

WG3 (25), WG4 (44)

Using/treating workflows as components

deprecated

mandatory

Incorporation of multiple resources in parallel

deprecated

recommended

It should be possible to determine the source of an annotation/assigned category

final

recommended

Components should handle failures gracefully

deprecated

recommended

Processing components should be downloadable

final

recommended

The actual content of all content resources must be discoverable

deprecated

mandatory

Metrics for the confidence level of the TDM operation should be included in the metadata

deprecated

optional

Metrics for the performance of the TDM operation should be included in the metadata

deprecated

optional

Version must be included in the metadata description for all resources

deprecated

mandatory

Licensing information must be included in the metadata

final

mandatory

Licensing information should be expressed in a machine-readable form

final

recommended

All resources must include a unique persistent identifier

deprecated

mandatory

Classification metadata should be included, where applicable, in the metadata record of the resource

final

recommended

Information on the structural annotation (layout) of resources should be included in the metadata of the resource

final

recommended

Access mode of resources must be included in the metadata

final

mandatory

Content resources must include metadata on their format (e.g. XML, DOCX etc.)

final

mandatory

Component metadata must include standardised categories/tags that make them easy to discover

deprecated

mandatory

Content resources must include metadata on their language(s)

final

mandatory

The metadata can include the information on which projects/workflows involve the resource

deprecated

optional

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

final

mandatory

Statistical metadata that allow monitoring of resource versions may accompany resources

final

optional

S/W (tools, web services, workflows) must indicate format of their output

final

mandatory

Output resources of web services/workflows must be accompanied by provenance metadata

deprecated

mandatory

Information on funding of resources may be included in the metadata

final

optional

All resource metadata records must include a reference to the metadata schema used for their description

deprecated

mandatory

Metadata of tools should contain information about the models available for them

deprecated

recommended

Documentation references should be versioned

final

recommended

License should be attached

draft

recommended

License information must be in metadata

deprecated

recommended

Licensor must be entitled to grant license

draft

recommended

Licensees should remain with a copy of the license

draft

recommended

Standard licenses should be used

draft

recommended

License should be machine readable

draft

recommended

License should be understandable by non-lawyers

draft

recommended

TDM must be explicitly allowed

draft

recommended

Right for (temporary) reproduction must be granted

draft

recommended

Boundary for derivative work must be clearly defined

draft

recommended

No restrictions on TDM results which are not derived works

draft

recommended

World-wide and irrevocable license grant

draft

recommended

License must qualify for Open Access rights

draft

recommended

License must qualify for Open Access uses

draft

recommended

License must qualify for Open Access must not restrict use in any way

draft

recommended

License must qualify for Open Access may include attribution requirements

draft

recommended

Knowledge Resource Element Id

final

recommended

Data Category Linking Vocabulary

final

recommended

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

final

recommended

All KR content elements need to be added as text annotations within a TDM workflow.

final

mandatory

The KR should be ingestible through a URI

final

recommended

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

final

recommended

concrete (42)

ID	Requirement	Status	Strength	WG’s
7	Components must have a fully qualified name that follows the Java class naming conventions	final	mandatory	WG4 (44)
17	Components should be stateless	final	recommended	WG4 (44)
20	Workflow engines should not require to see data	deprecated	recommended	WG2 (27), WG4 (44)
73	Stick to widely used data compression formats	draft	best practice	WG4 (44)
74	Machine-readable metadata for UIMA components	draft	mandatory	WG1 (40), WG4 (44)
75	Embedding UIMA component metadata into the source code	draft	mandatory	WG1 (40), WG4 (44)
76	Separating UIMA metadata from the component	draft	mandatory	WG1 (40), WG4 (44)
77	Unique identifiers and versions for UIMA components	deprecated	mandatory	WG1 (40), WG4 (44)
78	Specifying input and output types of UIMA components	draft	mandatory	WG1 (40), WG4 (44)
79	Documentation of UIMA components	draft	mandatory	WG1 (40), WG4 (44)
80	Common elements to represent/describe an executable workflow	deprecated	recommended	WG4 (44)
81	Embedding GATE component metadata into the source code	draft	mandatory	WG1 (40), WG4 (44)
82	Unique identifiers and versions for GATE components	deprecated	mandatory	WG1 (40), WG4 (44)
83	Documentation of GATE components	draft	mandatory	WG1 (40), WG4 (44)
84	Separating GATE metadata from the component	draft	mandatory	WG1 (40), WG4 (44)
85	Unique identifier and version for components in the OMTD-SHARE schema	deprecated	mandatory	WG1 (40), WG4 (44)
86	Attaching format properties to the description of the inputs and outputs that use file	deprecated	recommended	WG1 (40), WG4 (44)
87	Embedding language capability in UIMA component metadata	deprecated	mandatory	WG1 (40), WG4 (44)
88	Embedding output format in UIMA component metadata	draft	mandatory	WG1 (40), WG4 (44)
89	Version documentation in parallel with component/resource	draft	recommended	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	draft	mandatory	WG1 (40), WG4 (44)
91	Encoding citable publications (for scholarly attribution) in resource metadata records	draft	recommended	WG1 (40), WG2 (27), WG4 (44)
92	Including license text in resource packages	draft	recommended	WG1 (40), WG3 (25), WG4 (44)
93	Provide identifiers for knowledge resource elements	draft	recommended	WG2 (27)
94	Data Category Linking Vocabulary	draft	recommended	WG2 (27)
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	draft	recommended	WG2 (27)
96	Unique identifiers and versions for components using Maven	draft	mandatory	WG1 (40), WG4 (44)
97	Declaring scaleout capability in UIMA	draft	mandatory	WG4 (44)
98	Publishing components via software repositories (Maven, Docker)	draft	mandatory	WG4 (44)
99	Encoding in the metadata a direct access link for content resources	draft	mandatory	WG1 (40)
100	Providing access to content resources (sharing/exposing and transferring)	draft	mandatory	WG1 (40)
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	draft	recommended	WG1 (40), WG2 (27), WG4 (44)
102	Adding version information in the metadata descriptions of all resources	draft	mandatory	WG1 (40), WG2 (27), WG3 (25), WG4 (44)
103	Specifying access mode of resources and encoding it in the metadata descriptions	draft	mandatory	WG1 (40), WG2 (27), WG4 (44)
104	Encoding funding information in the metadata descriptions of all resources	draft	recommended	WG1 (40), WG2 (27), WG4 (44)
105	Encoding of format in the metadata description of content resources	draft	mandatory	WG1 (40)
106	Encoding licensing terms in the metadata description of the resource	draft	mandatory	WG1 (40), WG3 (25)
107	Encoding metadata on domain/subject/ classification for all resources when applicable	draft	recommended	WG1 (40), WG2 (27)
108	Encoding language information in the metadata of content resources	draft	mandatory	WG1 (40), WG2 (27)
109	Encoding statistical information in the content resources	draft	mandatory	WG1 (40), WG2 (27)
110	Assigning a unique persistent identifier for all resources	draft	mandatory	WG1 (40), WG2 (27), WG3 (25)
111	Annotation schema dependencies for UIMA components using Maven	draft	mandatory	WG4 (44)

Requirement

Status

Strength

WG’s

Components must have a fully qualified name that follows the Java class naming conventions

final

mandatory

Components should be stateless

final

recommended

Workflow engines should not require to see data

deprecated

recommended

Stick to widely used data compression formats

draft

best practice

Machine-readable metadata for UIMA components

draft

mandatory

Embedding UIMA component metadata into the source code

draft

mandatory

Separating UIMA metadata from the component

draft

mandatory

Unique identifiers and versions for UIMA components

deprecated

mandatory

Specifying input and output types of UIMA components

draft

mandatory

Documentation of UIMA components

draft

mandatory

Common elements to represent/describe an executable workflow

deprecated

recommended

Embedding GATE component metadata into the source code

draft

mandatory

Unique identifiers and versions for GATE components

deprecated

mandatory

Documentation of GATE components

draft

mandatory

Separating GATE metadata from the component

draft

mandatory

Unique identifier and version for components in the OMTD-SHARE schema

deprecated

mandatory

Attaching format properties to the description of the inputs and outputs that use file

deprecated

recommended

Embedding language capability in UIMA component metadata

deprecated

mandatory

Embedding output format in UIMA component metadata

draft

mandatory

Version documentation in parallel with component/resource

draft

recommended

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

draft

mandatory

Encoding citable publications (for scholarly attribution) in resource metadata records

draft

recommended

Including license text in resource packages

draft

recommended

WG1 (40), WG3 (25), WG4 (44)

Provide identifiers for knowledge resource elements

draft

recommended

Data Category Linking Vocabulary

draft

recommended

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

draft

recommended

Unique identifiers and versions for components using Maven

draft

mandatory

Declaring scaleout capability in UIMA

draft

mandatory

Publishing components via software repositories (Maven, Docker)

draft

mandatory

Encoding in the metadata a direct access link for content resources

draft

mandatory

Providing access to content resources (sharing/exposing and transferring)

draft

mandatory

Making models and annotation resources accessible as entities distinct from the components they are compatible with

draft

recommended

Adding version information in the metadata descriptions of all resources

draft

mandatory

Specifying access mode of resources and encoding it in the metadata descriptions

draft

mandatory

Encoding funding information in the metadata descriptions of all resources

draft

recommended

Encoding of format in the metadata description of content resources

draft

mandatory

Encoding licensing terms in the metadata description of the resource

draft

mandatory

Encoding metadata on domain/subject/ classification for all resources when applicable

draft

recommended

Encoding language information in the metadata of content resources

draft

mandatory

Encoding statistical information in the content resources

draft

mandatory

Assigning a unique persistent identifier for all resources

draft

mandatory

Annotation schema dependencies for UIMA components using Maven

draft

mandatory

Components must be described by machine-readable metadata

Compliance

By Product

Numbers exclude deprecated requirements.

ARGO (44)

Compliance	#	%
Full	9	20
No	15	34
Partial	20	45

ID	Requirement	Compliance
1	Components must be described by machine-readable metadata	Full
2	Component metadata have to be embedded into the component source code	No
3	Component metadata must be separable from the component	Partial
5	Components must detail all their environmental requirements for execution	Partial
6	Components should have a unique identifier and a version number	Partial
7	Components must have a fully qualified name that follows the Java class naming conventions	Partial
8	Components must associate themselves with categories defined by the OpenMinTeD project	Partial
9	Components must declare their annotation schema dependencies	Full
10	Components should specify the types of the annotations that they input and output	Partial
11	Components must declare whether they can be scaled within a workflow	Full
12	Components should provide documentation describing their functionality	Partial
13	Citation information for component should be included in the metadata	No
16	Models/resources should be useable across different component collections/platforms	Partial
17	Components should be stateless	Partial
21	Configuration and parametrizable options of the components should be identified and documented	Full
26	It should be possible to determine the source of an annotation/assigned category	No
28	Processing components should be downloadable	No
33	Licensing information must be included in the metadata	Partial
36	Classification metadata should be included, where applicable, in the metadata record of the resource	No
38	Access mode of resources must be included in the metadata	Partial
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	No
45	S/W (tools, web services, workflows) must indicate format of their output	Partial
50	Documentation references should be versioned	No
74	Machine-readable metadata for UIMA components	Partial
75	Embedding UIMA component metadata into the source code	Partial
76	Separating UIMA metadata from the component	Full
78	Specifying input and output types of UIMA components	Partial
79	Documentation of UIMA components	Partial
88	Embedding output format in UIMA component metadata	No
89	Version documentation in parallel with component/resource	No
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
92	Including license text in resource packages	Partial
96	Unique identifiers and versions for components using Maven	Full
97	Declaring scaleout capability in UIMA	Full
98	Publishing components via software repositories (Maven, Docker)	No
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	No
102	Adding version information in the metadata descriptions of all resources	Partial
103	Specifying access mode of resources and encoding it in the metadata descriptions	Partial
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	Partial
107	Encoding metadata on domain/subject/ classification for all resources when applicable	No
110	Assigning a unique persistent identifier for all resources	Full
111	Annotation schema dependencies for UIMA components using Maven	Full

Requirement

Compliance

Full

Component metadata have to be embedded into the component source code

Component metadata must be separable from the component

Partial

Components must detail all their environmental requirements for execution

Partial

Components should have a unique identifier and a version number

Partial

Components must have a fully qualified name that follows the Java class naming conventions

Partial

Components must associate themselves with categories defined by the OpenMinTeD project

Partial

Components must declare their annotation schema dependencies

Full

Components should specify the types of the annotations that they input and output

Partial

Components must declare whether they can be scaled within a workflow

Full

Components should provide documentation describing their functionality

Partial

Citation information for component should be included in the metadata

Models/resources should be useable across different component collections/platforms

Partial

Configuration and parametrizable options of the components should be identified and documented

Partial

Full

It should be possible to determine the source of an annotation/assigned category

Licensing information must be included in the metadata

Partial

Classification metadata should be included, where applicable, in the metadata record of the resource

Access mode of resources must be included in the metadata

Partial

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

S/W (tools, web services, workflows) must indicate format of their output

Partial

Machine-readable metadata for UIMA components

Partial

Embedding UIMA component metadata into the source code

Partial

Separating UIMA metadata from the component

Full

Specifying input and output types of UIMA components

Partial

Documentation of UIMA components

Partial

Embedding output format in UIMA component metadata

Version documentation in parallel with component/resource

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

Encoding citable publications (for scholarly attribution) in resource metadata records

Unique identifiers and versions for components using Maven

Partial

Full

Declaring scaleout capability in UIMA

Full

Publishing components via software repositories (Maven, Docker)

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

102

Adding version information in the metadata descriptions of all resources

Partial

103

Specifying access mode of resources and encoding it in the metadata descriptions

Partial

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

Partial

107

Encoding metadata on domain/subject/ classification for all resources when applicable

110

Assigning a unique persistent identifier for all resources

Full

111

Annotation schema dependencies for UIMA components using Maven

Full

Agrovoc (26)

Compliance	#	%
Full	21	81
No	4	15
Partial	1	4

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Full
33	Licensing information must be included in the metadata	Full
36	Classification metadata should be included, where applicable, in the metadata record of the resource	Full
38	Access mode of resources must be included in the metadata	Full
41	Content resources must include metadata on their language(s)	Full
44	Statistical metadata that allow monitoring of resource versions may accompany resources	Full
50	Documentation references should be versioned	No
67	Knowledge Resource Element Id	Full
68	Data Category Linking Vocabulary	Full
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	Full
71	The KR should be ingestible through a URI	Full
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
89	Version documentation in parallel with component/resource	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
93	Provide identifiers for knowledge resource elements	Full
94	Data Category Linking Vocabulary	Full
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	Full
102	Adding version information in the metadata descriptions of all resources	Partial
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	Full
107	Encoding metadata on domain/subject/ classification for all resources when applicable	Full
108	Encoding language information in the metadata of content resources	Full
109	Encoding statistical information in the content resources	Full
110	Assigning a unique persistent identifier for all resources	Full

Requirement

Compliance

Licensing information must be included in the metadata

Full

Full

Classification metadata should be included, where applicable, in the metadata record of the resource

Full

Access mode of resources must be included in the metadata

Full

Content resources must include metadata on their language(s)

Full

Statistical metadata that allow monitoring of resource versions may accompany resources

Full

Full

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

Full

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

Version documentation in parallel with component/resource

Encoding citable publications (for scholarly attribution) in resource metadata records

Provide identifiers for knowledge resource elements

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

Full

102

Adding version information in the metadata descriptions of all resources

Partial

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

Full

107

Encoding metadata on domain/subject/ classification for all resources when applicable

Full

108

Encoding language information in the metadata of content resources

Full

109

Encoding statistical information in the content resources

Full

110

Assigning a unique persistent identifier for all resources

Full

Alvis (37)

Compliance	#	%
Full	7	19
No	15	41
Partial	15	41

ID	Requirement	Compliance
1	Components must be described by machine-readable metadata	Full
2	Component metadata have to be embedded into the component source code	No
3	Component metadata must be separable from the component	Full
5	Components must detail all their environmental requirements for execution	Partial
6	Components should have a unique identifier and a version number	Partial
7	Components must have a fully qualified name that follows the Java class naming conventions	Full
8	Components must associate themselves with categories defined by the OpenMinTeD project	Partial
9	Components must declare their annotation schema dependencies	No
10	Components should specify the types of the annotations that they input and output	Partial
11	Components must declare whether they can be scaled within a workflow	No
12	Components should provide documentation describing their functionality	Partial
13	Citation information for component should be included in the metadata	No
16	Models/resources should be useable across different component collections/platforms	Partial
17	Components should be stateless	Partial
21	Configuration and parametrizable options of the components should be identified and documented	Full
26	It should be possible to determine the source of an annotation/assigned category	Partial
28	Processing components should be downloadable	Full
33	Licensing information must be included in the metadata	No
36	Classification metadata should be included, where applicable, in the metadata record of the resource	No
38	Access mode of resources must be included in the metadata	No
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	No
45	S/W (tools, web services, workflows) must indicate format of their output	No
50	Documentation references should be versioned	No
89	Version documentation in parallel with component/resource	No
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
92	Including license text in resource packages	Partial
96	Unique identifiers and versions for components using Maven	Partial
98	Publishing components via software repositories (Maven, Docker)	Partial
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	Partial
102	Adding version information in the metadata descriptions of all resources	Partial
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	Partial
107	Encoding metadata on domain/subject/ classification for all resources when applicable	No
110	Assigning a unique persistent identifier for all resources	Full
111	Annotation schema dependencies for UIMA components using Maven	Partial

Requirement

Compliance

Components must be described by machine-readable metadata

Full

Component metadata have to be embedded into the component source code

Component metadata must be separable from the component

Full

Components must detail all their environmental requirements for execution

Partial

Components should have a unique identifier and a version number

Partial

Components must have a fully qualified name that follows the Java class naming conventions

Full

Components must associate themselves with categories defined by the OpenMinTeD project

Partial

Components must declare their annotation schema dependencies

Components should specify the types of the annotations that they input and output

Partial

Components must declare whether they can be scaled within a workflow

Components should provide documentation describing their functionality

Partial

Citation information for component should be included in the metadata

Models/resources should be useable across different component collections/platforms

Partial

Configuration and parametrizable options of the components should be identified and documented

Partial

Full

It should be possible to determine the source of an annotation/assigned category

Partial

Licensing information must be included in the metadata

Full

Classification metadata should be included, where applicable, in the metadata record of the resource

Access mode of resources must be included in the metadata

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

S/W (tools, web services, workflows) must indicate format of their output

Version documentation in parallel with component/resource

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

Encoding citable publications (for scholarly attribution) in resource metadata records

Unique identifiers and versions for components using Maven

Partial

Partial

Publishing components via software repositories (Maven, Docker)

Partial

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

Partial

102

Adding version information in the metadata descriptions of all resources

Partial

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

Partial

107

Encoding metadata on domain/subject/ classification for all resources when applicable

110

Assigning a unique persistent identifier for all resources

Full

111

Annotation schema dependencies for UIMA components using Maven

Partial

CLARIN CCR (8)

Compliance	#	%
Full	4	50
No	4	50

ID	Requirement	Compliance
67	Knowledge Resource Element Id	Full
68	Data Category Linking Vocabulary	No
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	No
71	The KR should be ingestible through a URI	No
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
93	Provide identifiers for knowledge resource elements	Full
94	Data Category Linking Vocabulary	No
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full

Requirement

Compliance

Full

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Provide identifiers for knowledge resource elements

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

CORE (18)

Compliance	#	%
Full	7	39
No	1	6
Partial	10	56

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Partial
33	Licensing information must be included in the metadata	Partial
36	Classification metadata should be included, where applicable, in the metadata record of the resource	Partial
37	Information on the structural annotation (layout) of resources should be included in the metadata of the resource	Partial
38	Access mode of resources must be included in the metadata	Full
39	Content resources must include metadata on their format (e.g. XML, DOCX etc.)	Partial
41	Content resources must include metadata on their language(s)	Partial
99	Encoding in the metadata a direct access link for content resources	Partial
100	Providing access to content resources (sharing/exposing and transferring)	Full
102	Adding version information in the metadata descriptions of all resources	Partial
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
105	Encoding of format in the metadata description of content resources	Full
106	Encoding licensing terms in the metadata description of the resource	Partial
107	Encoding metadata on domain/subject/ classification for all resources when applicable	Partial
108	Encoding language information in the metadata of content resources	Full
109	Encoding statistical information in the content resources	Full
110	Assigning a unique persistent identifier for all resources	Full

Requirement

Compliance

Licensing information must be included in the metadata

Partial

Partial

Classification metadata should be included, where applicable, in the metadata record of the resource

Partial

Information on the structural annotation (layout) of resources should be included in the metadata of the resource

Partial

Access mode of resources must be included in the metadata

Full

Content resources must include metadata on their format (e.g. XML, DOCX etc.)

Partial

Content resources must include metadata on their language(s)

Partial

Encoding in the metadata a direct access link for content resources

Partial

100

Providing access to content resources (sharing/exposing and transferring)

Full

102

Adding version information in the metadata descriptions of all resources

Partial

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

105

Encoding of format in the metadata description of content resources

Full

106

Encoding licensing terms in the metadata description of the resource

Partial

107

Encoding metadata on domain/subject/ classification for all resources when applicable

Partial

108

Encoding language information in the metadata of content resources

Full

109

Encoding statistical information in the content resources

Full

110

Assigning a unique persistent identifier for all resources

Full

DKPro Core (46)

Compliance	#	%
Full	26	57
No	6	13
Partial	14	30

ID	Requirement	Compliance
1	Components must be described by machine-readable metadata	Full
2	Component metadata have to be embedded into the component source code	Full
3	Component metadata must be separable from the component	Full
5	Components must detail all their environmental requirements for execution	Partial
6	Components should have a unique identifier and a version number	Partial
7	Components must have a fully qualified name that follows the Java class naming conventions	Full
8	Components must associate themselves with categories defined by the OpenMinTeD project	No
9	Components must declare their annotation schema dependencies	Partial
10	Components should specify the types of the annotations that they input and output	Full
11	Components must declare whether they can be scaled within a workflow	Full
12	Components should provide documentation describing their functionality	Full
13	Citation information for component should be included in the metadata	No
16	Models/resources should be useable across different component collections/platforms	Full
17	Components should be stateless	Partial
21	Configuration and parametrizable options of the components should be identified and documented	Full
26	It should be possible to determine the source of an annotation/assigned category	Partial
28	Processing components should be downloadable	Full
33	Licensing information must be included in the metadata	Partial
34	Licensing information should be expressed in a machine-readable form	Partial
36	Classification metadata should be included, where applicable, in the metadata record of the resource	Partial
38	Access mode of resources must be included in the metadata	Full
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	Partial
45	S/W (tools, web services, workflows) must indicate format of their output	Partial
50	Documentation references should be versioned	Full
68	Data Category Linking Vocabulary	Full
74	Machine-readable metadata for UIMA components	Full
75	Embedding UIMA component metadata into the source code	Partial
76	Separating UIMA metadata from the component	Full
78	Specifying input and output types of UIMA components	Full
79	Documentation of UIMA components	Full
88	Embedding output format in UIMA component metadata	Full
89	Version documentation in parallel with component/resource	Full
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
92	Including license text in resource packages	Partial
96	Unique identifiers and versions for components using Maven	Full
97	Declaring scaleout capability in UIMA	Full
98	Publishing components via software repositories (Maven, Docker)	Full
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	Partial
102	Adding version information in the metadata descriptions of all resources	Full
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	Partial
107	Encoding metadata on domain/subject/ classification for all resources when applicable	No
110	Assigning a unique persistent identifier for all resources	Full
111	Annotation schema dependencies for UIMA components using Maven	Full

Requirement

Compliance

Components must be described by machine-readable metadata

Full

Component metadata have to be embedded into the component source code

Full

Component metadata must be separable from the component

Full

Components must detail all their environmental requirements for execution

Partial

Components should have a unique identifier and a version number

Partial

Components must have a fully qualified name that follows the Java class naming conventions

Full

Components must associate themselves with categories defined by the OpenMinTeD project

Components must declare their annotation schema dependencies

Partial

Components should specify the types of the annotations that they input and output

Full

Components must declare whether they can be scaled within a workflow

Full

Components should provide documentation describing their functionality

Full

Citation information for component should be included in the metadata

Models/resources should be useable across different component collections/platforms

Full

Configuration and parametrizable options of the components should be identified and documented

Partial

Full

It should be possible to determine the source of an annotation/assigned category

Partial

Licensing information must be included in the metadata

Full

Partial

Licensing information should be expressed in a machine-readable form

Partial

Classification metadata should be included, where applicable, in the metadata record of the resource

Partial

Access mode of resources must be included in the metadata

Full

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

Partial

S/W (tools, web services, workflows) must indicate format of their output

Partial

Full

Machine-readable metadata for UIMA components

Full

Full

Embedding UIMA component metadata into the source code

Partial

Separating UIMA metadata from the component

Full

Specifying input and output types of UIMA components

Full

Documentation of UIMA components

Full

Embedding output format in UIMA component metadata

Full

Version documentation in parallel with component/resource

Full

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

Encoding citable publications (for scholarly attribution) in resource metadata records

Unique identifiers and versions for components using Maven

Partial

Full

Declaring scaleout capability in UIMA

Full

Publishing components via software repositories (Maven, Docker)

Full

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

Partial

102

Adding version information in the metadata descriptions of all resources

Full

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

Partial

107

Encoding metadata on domain/subject/ classification for all resources when applicable

110

Assigning a unique persistent identifier for all resources

Full

111

Annotation schema dependencies for UIMA components using Maven

Full

Frontiers (18)

Compliance	#	%
Full	9	50
No	5	28
Partial	4	22

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Full
33	Licensing information must be included in the metadata	Full
36	Classification metadata should be included, where applicable, in the metadata record of the resource	Partial
37	Information on the structural annotation (layout) of resources should be included in the metadata of the resource	Full
38	Access mode of resources must be included in the metadata	Full
39	Content resources must include metadata on their format (e.g. XML, DOCX etc.)	Full
41	Content resources must include metadata on their language(s)	Full
99	Encoding in the metadata a direct access link for content resources	Full
100	Providing access to content resources (sharing/exposing and transferring)	Partial
102	Adding version information in the metadata descriptions of all resources	No
103	Specifying access mode of resources and encoding it in the metadata descriptions	Partial
104	Encoding funding information in the metadata descriptions of all resources	No
105	Encoding of format in the metadata description of content resources	No
106	Encoding licensing terms in the metadata description of the resource	No
107	Encoding metadata on domain/subject/ classification for all resources when applicable	Partial
108	Encoding language information in the metadata of content resources	No
109	Encoding statistical information in the content resources	Full
110	Assigning a unique persistent identifier for all resources	Full

Requirement

Compliance

Licensing information must be included in the metadata

Full

Full

Classification metadata should be included, where applicable, in the metadata record of the resource

Partial

Information on the structural annotation (layout) of resources should be included in the metadata of the resource

Full

Access mode of resources must be included in the metadata

Full

Content resources must include metadata on their format (e.g. XML, DOCX etc.)

Full

Content resources must include metadata on their language(s)

Full

Encoding in the metadata a direct access link for content resources

Full

100

Providing access to content resources (sharing/exposing and transferring)

Partial

102

Adding version information in the metadata descriptions of all resources

103

Specifying access mode of resources and encoding it in the metadata descriptions

Partial

104

Encoding funding information in the metadata descriptions of all resources

105

Encoding of format in the metadata description of content resources

106

Encoding licensing terms in the metadata description of the resource

107

Encoding metadata on domain/subject/ classification for all resources when applicable

Partial

108

Encoding language information in the metadata of content resources

109

Encoding statistical information in the content resources

Full

110

Assigning a unique persistent identifier for all resources

Full

GATE (36)

Compliance	#	%
Full	13	36
No	14	39
Partial	9	25

ID	Requirement	Compliance
1	Components must be described by machine-readable metadata	Full
2	Component metadata have to be embedded into the component source code	Partial
3	Component metadata must be separable from the component	Partial
5	Components must detail all their environmental requirements for execution	Partial
6	Components should have a unique identifier and a version number	Partial
7	Components must have a fully qualified name that follows the Java class naming conventions	Full
8	Components must associate themselves with categories defined by the OpenMinTeD project	No
9	Components must declare their annotation schema dependencies	No
10	Components should specify the types of the annotations that they input and output	Partial
11	Components must declare whether they can be scaled within a workflow	Full
12	Components should provide documentation describing their functionality	Full
13	Citation information for component should be included in the metadata	No
16	Models/resources should be useable across different component collections/platforms	Full
17	Components should be stateless	No
21	Configuration and parametrizable options of the components should be identified and documented	Full
26	It should be possible to determine the source of an annotation/assigned category	Partial
28	Processing components should be downloadable	Full
33	Licensing information must be included in the metadata	Partial
36	Classification metadata should be included, where applicable, in the metadata record of the resource	Partial
38	Access mode of resources must be included in the metadata	Full
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	No
45	S/W (tools, web services, workflows) must indicate format of their output	No
50	Documentation references should be versioned	No
89	Version documentation in parallel with component/resource	No
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
92	Including license text in resource packages	No
96	Unique identifiers and versions for components using Maven	Partial
98	Publishing components via software repositories (Maven, Docker)	Full
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	No
102	Adding version information in the metadata descriptions of all resources	Full
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	Full
107	Encoding metadata on domain/subject/ classification for all resources when applicable	No
110	Assigning a unique persistent identifier for all resources	Full

Requirement

Compliance

Components must be described by machine-readable metadata

Full

Component metadata have to be embedded into the component source code

Partial

Component metadata must be separable from the component

Partial

Components must detail all their environmental requirements for execution

Partial

Components should have a unique identifier and a version number

Partial

Components must have a fully qualified name that follows the Java class naming conventions

Full

Components must associate themselves with categories defined by the OpenMinTeD project

Components must declare their annotation schema dependencies

Components should specify the types of the annotations that they input and output

Partial

Components must declare whether they can be scaled within a workflow

Full

Components should provide documentation describing their functionality

Full

Citation information for component should be included in the metadata

Models/resources should be useable across different component collections/platforms

Full

Configuration and parametrizable options of the components should be identified and documented

Full

It should be possible to determine the source of an annotation/assigned category

Partial

Licensing information must be included in the metadata

Full

Partial

Classification metadata should be included, where applicable, in the metadata record of the resource

Partial

Access mode of resources must be included in the metadata

Full

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

S/W (tools, web services, workflows) must indicate format of their output

Version documentation in parallel with component/resource

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

Encoding citable publications (for scholarly attribution) in resource metadata records

Unique identifiers and versions for components using Maven

Partial

Publishing components via software repositories (Maven, Docker)

Full

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

102

Adding version information in the metadata descriptions of all resources

Full

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

Full

107

Encoding metadata on domain/subject/ classification for all resources when applicable

110

Assigning a unique persistent identifier for all resources

Full

ILSP (44)

Compliance	#	%
Full	14	32
No	15	34
Partial	15	34

ID	Requirement	Compliance
1	Components must be described by machine-readable metadata	Full
2	Component metadata have to be embedded into the component source code	Partial
3	Component metadata must be separable from the component	Full
5	Components must detail all their environmental requirements for execution	Full
6	Components should have a unique identifier and a version number	Partial
7	Components must have a fully qualified name that follows the Java class naming conventions	Full
8	Components must associate themselves with categories defined by the OpenMinTeD project	No
9	Components must declare their annotation schema dependencies	Partial
10	Components should specify the types of the annotations that they input and output	Partial
11	Components must declare whether they can be scaled within a workflow	Full
12	Components should provide documentation describing their functionality	Partial
13	Citation information for component should be included in the metadata	Partial
16	Models/resources should be useable across different component collections/platforms	Partial
17	Components should be stateless	Full
21	Configuration and parametrizable options of the components should be identified and documented	Full
26	It should be possible to determine the source of an annotation/assigned category	No
28	Processing components should be downloadable	No
33	Licensing information must be included in the metadata	Partial
36	Classification metadata should be included, where applicable, in the metadata record of the resource	No
38	Access mode of resources must be included in the metadata	Full
43	S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output	Partial
45	S/W (tools, web services, workflows) must indicate format of their output	Partial
50	Documentation references should be versioned	No
74	Machine-readable metadata for UIMA components	Partial
75	Embedding UIMA component metadata into the source code	Partial
76	Separating UIMA metadata from the component	Full
78	Specifying input and output types of UIMA components	Full
79	Documentation of UIMA components	Partial
88	Embedding output format in UIMA component metadata	No
89	Version documentation in parallel with component/resource	No
90	Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
92	Including license text in resource packages	No
96	Unique identifiers and versions for components using Maven	Partial
97	Declaring scaleout capability in UIMA	Full
98	Publishing components via software repositories (Maven, Docker)	No
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	No
102	Adding version information in the metadata descriptions of all resources	No
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	Partial
107	Encoding metadata on domain/subject/ classification for all resources when applicable	No
110	Assigning a unique persistent identifier for all resources	Full
111	Annotation schema dependencies for UIMA components using Maven	Full

Requirement

Compliance

Components must be described by machine-readable metadata

Full

Component metadata have to be embedded into the component source code

Partial

Component metadata must be separable from the component

Full

Components must detail all their environmental requirements for execution

Full

Components should have a unique identifier and a version number

Partial

Components must have a fully qualified name that follows the Java class naming conventions

Full

Components must associate themselves with categories defined by the OpenMinTeD project

Components must declare their annotation schema dependencies

Partial

Components should specify the types of the annotations that they input and output

Partial

Components must declare whether they can be scaled within a workflow

Full

Components should provide documentation describing their functionality

Partial

Citation information for component should be included in the metadata

Partial

Models/resources should be useable across different component collections/platforms

Partial

Configuration and parametrizable options of the components should be identified and documented

Full

Full

It should be possible to determine the source of an annotation/assigned category

Licensing information must be included in the metadata

Partial

Classification metadata should be included, where applicable, in the metadata record of the resource

Access mode of resources must be included in the metadata

Full

S/W (tools, web services, workflows) must indicate whether they are language-independent or the language(s) of the resources they take as input and output

Partial

S/W (tools, web services, workflows) must indicate format of their output

Partial

Machine-readable metadata for UIMA components

Partial

Embedding UIMA component metadata into the source code

Partial

Separating UIMA metadata from the component

Full

Specifying input and output types of UIMA components

Full

Documentation of UIMA components

Partial

Embedding output format in UIMA component metadata

Version documentation in parallel with component/resource

Components must be assigned at least one category from the OMTD-SHARE controlled vocabulary for component types

Encoding citable publications (for scholarly attribution) in resource metadata records

Unique identifiers and versions for components using Maven

Partial

Declaring scaleout capability in UIMA

Full

Publishing components via software repositories (Maven, Docker)

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

102

Adding version information in the metadata descriptions of all resources

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

Partial

107

Encoding metadata on domain/subject/ classification for all resources when applicable

110

Assigning a unique persistent identifier for all resources

Full

111

Annotation schema dependencies for UIMA components using Maven

Full

JATS (13)

Compliance	#	%
Full	2	15
N/A	1	8
No	5	38
Partial	5	38

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Partial
33	Licensing information must be included in the metadata	Partial
38	Access mode of resources must be included in the metadata	No
41	Content resources must include metadata on their language(s)	No
50	Documentation references should be versioned	No
67	Knowledge Resource Element Id	Partial
68	Data Category Linking Vocabulary	No
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	N/A
71	The KR should be ingestible through a URI	Full
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
93	Provide identifiers for knowledge resource elements	Partial
94	Data Category Linking Vocabulary	No
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Partial

Requirement

Compliance

Licensing information must be included in the metadata

Partial

Partial

Access mode of resources must be included in the metadata

Content resources must include metadata on their language(s)

Partial

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

N/A

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

Provide identifiers for knowledge resource elements

Partial

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Partial

LAPPS (25)

Compliance	#	%
Full	12	48
No	12	48
Partial	1	4

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Full
33	Licensing information must be included in the metadata	No
38	Access mode of resources must be included in the metadata	Full
41	Content resources must include metadata on their language(s)	No
44	Statistical metadata that allow monitoring of resource versions may accompany resources	No
50	Documentation references should be versioned	No
67	Knowledge Resource Element Id	Full
68	Data Category Linking Vocabulary	Full
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	Full
71	The KR should be ingestible through a URI	Full
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
89	Version documentation in parallel with component/resource	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
93	Provide identifiers for knowledge resource elements	Full
94	Data Category Linking Vocabulary	Full
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	No
102	Adding version information in the metadata descriptions of all resources	No
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	No
107	Encoding metadata on domain/subject/ classification for all resources when applicable	No
108	Encoding language information in the metadata of content resources	Partial
109	Encoding statistical information in the content resources	No
110	Assigning a unique persistent identifier for all resources	Full

Requirement

Compliance

Licensing information must be included in the metadata

Full

Access mode of resources must be included in the metadata

Full

Content resources must include metadata on their language(s)

Statistical metadata that allow monitoring of resource versions may accompany resources

Full

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

Full

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

Version documentation in parallel with component/resource

Encoding citable publications (for scholarly attribution) in resource metadata records

Provide identifiers for knowledge resource elements

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

102

Adding version information in the metadata descriptions of all resources

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

107

Encoding metadata on domain/subject/ classification for all resources when applicable

108

Encoding language information in the metadata of content resources

Partial

109

Encoding statistical information in the content resources

110

Assigning a unique persistent identifier for all resources

Full

Licences (4)

Compliance	#	%
Full	1	25
Partial	2	50
Unknown	1	25

ID	Requirement	Compliance
33	Licensing information must be included in the metadata	Full
34	Licensing information should be expressed in a machine-readable form	Partial
50	Documentation references should be versioned	Partial
89	Version documentation in parallel with component/resource	Unknown

Requirement

Compliance

Licensing information must be included in the metadata

Full

Licensing information should be expressed in a machine-readable form

Partial

Version documentation in parallel with component/resource

Partial

Unknown

OLiA (25)

Compliance	#	%
Full	12	48
No	12	48
Partial	1	4

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Full
33	Licensing information must be included in the metadata	No
38	Access mode of resources must be included in the metadata	Full
41	Content resources must include metadata on their language(s)	No
44	Statistical metadata that allow monitoring of resource versions may accompany resources	No
50	Documentation references should be versioned	No
67	Knowledge Resource Element Id	Full
68	Data Category Linking Vocabulary	Full
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	Full
71	The KR should be ingestible through a URI	Full
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
89	Version documentation in parallel with component/resource	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
93	Provide identifiers for knowledge resource elements	Full
94	Data Category Linking Vocabulary	Full
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	No
102	Adding version information in the metadata descriptions of all resources	Partial
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	No
107	Encoding metadata on domain/subject/ classification for all resources when applicable	No
108	Encoding language information in the metadata of content resources	No
109	Encoding statistical information in the content resources	No
110	Assigning a unique persistent identifier for all resources	Full

Requirement

Compliance

Licensing information must be included in the metadata

Full

Access mode of resources must be included in the metadata

Full

Content resources must include metadata on their language(s)

Statistical metadata that allow monitoring of resource versions may accompany resources

Full

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

Full

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

Version documentation in parallel with component/resource

Encoding citable publications (for scholarly attribution) in resource metadata records

Provide identifiers for knowledge resource elements

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

102

Adding version information in the metadata descriptions of all resources

Partial

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

107

Encoding metadata on domain/subject/ classification for all resources when applicable

108

Encoding language information in the metadata of content resources

109

Encoding statistical information in the content resources

110

Assigning a unique persistent identifier for all resources

Full

Ontolex (8)

Compliance	#	%
Full	8	100

Compliance

Full

100

ID	Requirement	Compliance
67	Knowledge Resource Element Id	Full
68	Data Category Linking Vocabulary	Full
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	Full
71	The KR should be ingestible through a URI	Full
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
93	Provide identifiers for knowledge resource elements	Full
94	Data Category Linking Vocabulary	Full
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full

Requirement

Compliance

Full

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

Full

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

Provide identifiers for knowledge resource elements

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

OpenAIRE (18)

Compliance	#	%
Full	2	11
No	4	22
Partial	12	67

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Partial
33	Licensing information must be included in the metadata	Partial
36	Classification metadata should be included, where applicable, in the metadata record of the resource	Partial
37	Information on the structural annotation (layout) of resources should be included in the metadata of the resource	Partial
38	Access mode of resources must be included in the metadata	Partial
39	Content resources must include metadata on their format (e.g. XML, DOCX etc.)	No
41	Content resources must include metadata on their language(s)	Partial
99	Encoding in the metadata a direct access link for content resources	Partial
100	Providing access to content resources (sharing/exposing and transferring)	No
102	Adding version information in the metadata descriptions of all resources	No
103	Specifying access mode of resources and encoding it in the metadata descriptions	Partial
104	Encoding funding information in the metadata descriptions of all resources	Full
105	Encoding of format in the metadata description of content resources	Partial
106	Encoding licensing terms in the metadata description of the resource	No
107	Encoding metadata on domain/subject/ classification for all resources when applicable	Partial
108	Encoding language information in the metadata of content resources	Full
109	Encoding statistical information in the content resources	Partial
110	Assigning a unique persistent identifier for all resources	Partial

Requirement

Compliance

Licensing information must be included in the metadata

Partial

Partial

Classification metadata should be included, where applicable, in the metadata record of the resource

Partial

Information on the structural annotation (layout) of resources should be included in the metadata of the resource

Partial

Access mode of resources must be included in the metadata

Partial

Content resources must include metadata on their format (e.g. XML, DOCX etc.)

Content resources must include metadata on their language(s)

Partial

Encoding in the metadata a direct access link for content resources

Partial

100

Providing access to content resources (sharing/exposing and transferring)

102

Adding version information in the metadata descriptions of all resources

103

Specifying access mode of resources and encoding it in the metadata descriptions

Partial

104

Encoding funding information in the metadata descriptions of all resources

Full

105

Encoding of format in the metadata description of content resources

Partial

106

Encoding licensing terms in the metadata description of the resource

107

Encoding metadata on domain/subject/ classification for all resources when applicable

Partial

108

Encoding language information in the metadata of content resources

Full

109

Encoding statistical information in the content resources

Partial

110

Assigning a unique persistent identifier for all resources

Partial

TheSoz (26)

Compliance	#	%
Full	20	77
No	4	15
Partial	2	8

ID	Requirement	Compliance
4	URL to actual content must be discoverable	Full
33	Licensing information must be included in the metadata	Full
36	Classification metadata should be included, where applicable, in the metadata record of the resource	Full
38	Access mode of resources must be included in the metadata	Full
41	Content resources must include metadata on their language(s)	Full
44	Statistical metadata that allow monitoring of resource versions may accompany resources	Partial
50	Documentation references should be versioned	No
67	Knowledge Resource Element Id	Full
68	Data Category Linking Vocabulary	Full
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	Full
71	The KR should be ingestible through a URI	Full
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
89	Version documentation in parallel with component/resource	No
91	Encoding citable publications (for scholarly attribution) in resource metadata records	No
93	Provide identifiers for knowledge resource elements	Full
94	Data Category Linking Vocabulary	Full
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
101	Making models and annotation resources accessible as entities distinct from the components they are compatible with	Partial
102	Adding version information in the metadata descriptions of all resources	Full
103	Specifying access mode of resources and encoding it in the metadata descriptions	Full
104	Encoding funding information in the metadata descriptions of all resources	No
106	Encoding licensing terms in the metadata description of the resource	Full
107	Encoding metadata on domain/subject/ classification for all resources when applicable	Full
108	Encoding language information in the metadata of content resources	Full
109	Encoding statistical information in the content resources	Full
110	Assigning a unique persistent identifier for all resources	Full

Requirement

Compliance

Licensing information must be included in the metadata

Full

Full

Classification metadata should be included, where applicable, in the metadata record of the resource

Full

Access mode of resources must be included in the metadata

Full

Content resources must include metadata on their language(s)

Full

Statistical metadata that allow monitoring of resource versions may accompany resources

Partial

Full

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

Full

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

Version documentation in parallel with component/resource

Encoding citable publications (for scholarly attribution) in resource metadata records

Provide identifiers for knowledge resource elements

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

101

Making models and annotation resources accessible as entities distinct from the components they are compatible with

Partial

102

Adding version information in the metadata descriptions of all resources

Full

103

Specifying access mode of resources and encoding it in the metadata descriptions

Full

104

Encoding funding information in the metadata descriptions of all resources

106

Encoding licensing terms in the metadata description of the resource

Full

107

Encoding metadata on domain/subject/ classification for all resources when applicable

Full

108

Encoding language information in the metadata of content resources

Full

109

Encoding statistical information in the content resources

Full

110

Assigning a unique persistent identifier for all resources

Full

schema.org (8)

Compliance	#	%
Full	7	88
No	1	13

ID	Requirement	Compliance
67	Knowledge Resource Element Id	Full
68	Data Category Linking Vocabulary	Full
69	Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.	No
71	The KR should be ingestible through a URI	Full
72	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full
93	Provide identifiers for knowledge resource elements	Full
94	Data Category Linking Vocabulary	Full
95	The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.	Full

Requirement

Compliance

Full

Interoperability between elements from different knowledge resource schemas should be expressed through RDF statements.

Full

The KR format should be in a standard format such as XML, JSON-LD or RDF/XML.

Full

Full

Provide identifiers for knowledge resource elements

Full