L8 Task Group Meeting

December 8 - 9, 2004

Minutes

 

Attendees:

Dan Gillman, BLS

Judith Newton, Ashton Computing & Management Services, LLC

Gail Wright, Oracle Corp.

Frank Farance, Farance Inc.

Jim Mercurio, VA

Paul Levine, Global e-Business Advisory Council, WG1

Kevin Keck, LBNL

Bruce Bargmeyer, LBNL

 

Topics:

 

WG2 Meeting in Washington Report

Frank Farance reported: 

  • Thanked Dan and BLS for hosting this meeting.
  • 19763 Metamodel Framework – some progress, but don’t know what problem its solving.  Difficult topic.
  • Discussed 20944.  Some parts in ready state, some not.  Moved ones not ready back to state 0.  Other parts in CD status.
  • 19773  Some parts expired.  Doug revived them administratively.  13 parts now in total.
  • 20943 Part 2 (XML part) withdrawn temporarily due to no progress.
  • Common Logic – good, informative presentation
  • Not discussed: 24706 and 14957
  • Participation of US committee was good.  Some free form discussion among US participants about technical issues – area for improvement.

Dan Gillman reported:

  • Thought event went well
  • Main surprise – discussion of XMDR – activities are happening outside of the US tag; however, got wrong impression that this had sole US involvement.
  • Ray posted WG2 meeting documents.  Find on metadata-standards.org website. Click on Library link.  Scroll down to Meeting Documents.  Should be the first in list: Washington Meeting.  Also,  the WG2 Resolutions and Meeting Report should be on the SC32 website; however, it is not currently posted.
  • The death date for 11179-3 is coming up (2 years with no change or reissue).

 

WG1 Meeting in Tallinn Estonia Report

Paul Levine reported:

  • Paul presented the WG1 Report as he provided to the L8 reflector mailing list.
  • WG1 meeting was hosted by the Finland national body.  Held on October 25-29, 2004.
  • Agenda:  Process contributions on ISO/IEC 15944 Parts 2-5 in preparation for FCD ballot on Part 2 and 2nd CD ballots on Parts 3-5
  • Came up with a revision document for Parts 2-5 (WG1 N269-N272) which consolidated resolution of ballot comments and editing instructions. 
  • Discussed issues with having continued Microsoft support for this activity.
  • Discussion on patent rights.  Frank – can’t make changes to patent language in committee.  Paul – lots of discussion on intellectual property rights.  Why have patent language in ISO documents?  The issue is that the statement does not state what the patent rights are (if any) and can be misleading as the standard is created independently of  the creation of patents.  However, parties might make claims for patent infringement.  In creating the standard, we could infringe on patent rights without knowing it.  Our non-legal interpretation is that ISO includes this language to protect itself against any pre-existing patents.  Dan – the purpose of this discussion is to figure out what the US tag’s position is. Frank – the secretariat made this patent disclaimer in conjunction with attorneys and believe it has some importance; therefore.  Committees have no say in this matter.  From the US point of view, we can say that the patent right policy discussion is “out of order” to discuss in WG1.  We cannot clarify or state what the policy is.  All agreed.
  • Frank would like to create a framework for metadata service standards on the web (vs. web services) that includes usability, accessibility, privacy policies, etc.  Frank can approach a company/firm that does this type of work to get input.  Dan believes that there is an enormous amount of work already done in this area.

 

XMDR

Bruce Bargmeyer and Kevin Keck reported:

  • Website: www.xmdr.org
  • Looking at persistent stores (different management systems: Relational, Object, XML, RDF, etc.) and analyze the query support that each technology provides.  For ontologies, query technologies may be insufficient in terms of performance and levels of granularity.
  • Project Scope:  11179 Part 2.  Start with existing classification schemes (SNOMED, LOINC, NAICS, etc.) and use them to classify Administered Items.  Determine the best models/technologies for such classification schemes that support the types of queries required – including ontology/semantic web type structures and queries.  Will likely not choose a relational or object management system.  Part 3 could still be implemented in a relational system as part of a hybrid solution with additional integration work.  Looking at Tucana and Protégé and ontology editors.  Plan to post recommendations on the website in January 2005.  If RDL-based, then must support RDQL or SPARQL.  Have a bias towards open source solutions.
  • Content of current metadata registries is useful to semantic web in that they provide “agreed upon” terminology.  There is more work in relating classification schemes to data elements.  In one case, a classification scheme item can categorize the data element.  In other case, the classification scheme may be the value domain of the data element. “Concept systems” is synonymous with the “Classification scheme”.  Could extend usage of 11179 registries to be the semantic servers on semantic webs.
  • Referenced Sam Chance’s presentation.  Kevin - want metadata registry to be more “semantics aware” using software accessible descriptions that describes relationships (for example: between Administered Items).  For example: A particular concept “is comparable” to another concept because they have a common ancestor. Frank – This is a problem with software systems as they cannot interpret an “is a” relationship.  Such relationships are better interpreted by humans.  Kevin - Ontology: A model that software uses to describe a data set semantically vs. structurally. 
  • Frank – Going back to the Terminology Server idea – there is a relationship between concepts and signs, manage relationships among themselves, etc.  If using object oriented technology “class” and 11179 is a sub-class with specialization which is apparent in the attributes we use. 11179 specializes in the “data industry” which is different than specializing in “terminology/linguistics”.  There can be overlap, but there are also distinctions.  Bruce agrees.  However, the metadata registry may be reused for secondary purposes with different intent than originally envisioned.
  • OMG working on ODM (Ontology Definition Metamodel). 

 

Dimensionality

 

  • Dan - In WG2, agreed upon a definition, but don’t have consensus on how this fits in the model. Could not find WG2 consensus definition; however, did find: Dimensionality - semantically similar units of measure.  Dan believes dimensionality should not be part of the Conceptual Domain.  Have attached it there because Dimensionality is the conceptual counterpart to Units of Measure, which is representational.
  • Dan - Units of measure, dimensionality, data type, etc. seem to be related, yet are independent of each other.  Are units of measure only associated with non-enumerated domains?  Think of unitless data like a ratio – but do need to know the units of both the numerator and the denominator. 
  • Frank – Enumerated vs. Non-enumerated – don’t believe this is a factor or makes a difference for units of measure.  Ratio – algebraic model - applies in some circumstances, but not in others.  Original question – why do we care?  Would like to have unit of measure at the Conceptual Domain level (e.g. Temperature in Degrees Fahrenheit).  Frank Olken joined and disagreed, saying that “Degrees Farenheit” is definitely representational.
  • Lots of discussion on the subject.  Looked at 20943-1 example.  No consensus.  To be continued.

                                                                                                                 

20944

Frank Farance reported

  • ISO/IEC 20944 set of standards addresses the need to access the contents of an ISO/IEC 11179 metadata registry by computer.  20944 provides the provisions necessary to describe how to achieve this.  It does not provide implementation details.
  • Each new technology, e.g., programming languages, needs a binding.  A binding is a map from a standard or framework to another.
  • The 20944 standard is broken into pieces along the lines of conformity.  General aspects are separated from (language) specific requirements.  This is based on good design principles.
  • Bindings are provided in 3 types: codings, API's, and protocols.  Respectively, they describe data transfer rules, control transfer rules, or both.  Data transfer handles situations such as mapping the contents of a registry to an XML schema.  There are 2 kinds of control transfer: 1 way and 2 way.  An example of 1 way control transfer is the use of "go to" statements in a programming language.  Typically, the "go to" tells a program where to jump to, but there is no way to return.  An example of 2 way control transfer is a function or subroutine call.  The program jumps to another section of the code, and returns to the point of the jump when the function or subroutine is finished.
  • Frank recommends going to the example in clause 7.1 of 20944-01.  Here, read the other parts as required to understand the sequence of steps (e.g., read parts 01, 41, 40, 03, and 81).
  • Kevin Keck was worried about the scalability of the proposed architecture in 20944.  He proposed a stateless, rather then state-based, authentication process.  Frank said it is too complex to encapsulate all authentication needs of every user at the enterprise level.  Instead, 20944 provides a hook to incorporate any authentication required by an application.  In a sense, agreement upon a specific authentication technique is outside the scope of 20944.  Furthermore, Frank stated that it is possible to implement a stateless system with the current specification.
  • Frank stated that the design criterion he aims to achieve is the support of implementation varieties that are interoperable, yet can choose their own implementation strategy (variety control defined in ISO/IEC Guide 2).

 

19773

Frank Farance reported

  • Read documents by working backward.  Start in Part 8, for instance.  The best arrangement for the parts is open for debate.
  • The important technical detail is the complex datatypes defined: reflit, multi-string, multi-text, multi-value, and multi-data.  All these are based on the ISO/IEC 11404 "choice" datatype generator.  Parts 3 and 4 give the details for these datatypes.

 

11179-3 3rd ed

Bruce Bargmeyer and Kevin Keck reported on Issue 136

  • Bruce and Kevin discussed their proposal to modify the "recursive" relationships on the following classes in the model: Classification Scheme Item; Value Domain; Conceptual Domain; and Data Element Concept.
  • The problems are that the current presentation in the UML diagram does not specify directionality and it separates the "recursive" relationships from the concepts.  The "recursive" relationships on a concept are part of what it is and need to be recorded that way.
  • The proposal modifies the UML diagram to relate the "recursive" relationships in a "part of" relation on the class.  See the Issue 136 contained in the SD14 standing document, obtained from the WG2 web site (http://metadata-standards.org) document library in the Washington meeting listing.
  • The participants at the meeting were in favor of the proposal.  Dan suggested there may be problems with extending the structure to value domains, but he was in favor of the structure on the other classes.
  • This led to further discussion of value domains and the problem of domains of mixed type.  That is, there are examples that have both enumerated and non-enumerated components.  The proposed solution is to create 3 value domains: one for the enumerated values; one for the non-enumerated values; and one as a parent value domain that contains the other 2.  The participants all felt this solution worked.
  • This led to further discussion of dimensionality, which is Issue 124 in the SD14.  The participants agreed that the definition as it now reads is correct.  The algebraic formulation of dimensionality is incomplete.  Characterizing operations are necessary to completely define the equivalence classes.  Kevin and Frank Olken were worried about this, but Kevin seemed satisfied after the discussion.  Frank was part of an equivalent discussion in the Washington WG2 meeting and was similarly satisfied.