nexml schema 1.0 ~/nexml/xsd/characters/protein.xsd

Skip to: Site menu | Main content

The future data exchange standard is here!

nexml is an exchange standard for representing phylogenetic data — inspired by the commonly used NEXUS format, but more robust and easier to process.

Process nexml data

Schema module documentation

~ / nexml / html / doc / schema-1 / characters / protein
rss | digg reddit del.icio.us facebook | Tue Dec 16 16:26:28 2008 —

Description

This module defines concrete subclasses for the abstract character matrix components that all character matrices must implement. The subclasses defined here apply to amino acid character data. In a verbose notation, this data is represented as the "state" attribute of the <cell> element, whose value-space is limited to the IUPAC single character nucleotide symbols. In a compact notation, the same data is represented as a sequence of tokens (whitespace is allowed but has no meaning).

Other views (open in new window):

Complex types

AAChar

Description

A concrete implementation of the AbstractChar element. In this implementation there is no reference to explicitly defined states, as we use the IUPAC standard. There must be an identifier ("id") that is of type xs:nonNegativeInteger to define the column number.

Inheritance

AAChar restricts AbstractChar.

Attributes
Name Type Usage
tokens MSTokenLength prohibited
states xs:NCName prohibited
codon CodonPosition prohibited
id xs:NCName required
Substructures
Definition source
  <xs:complexType name="AAChar" sawsdl:modelReference="http://evolutionaryontology-dev.nescent.org/cdao.owl#AminoAcidResidueCharacter">
    <xs:complexContent>
      <xs:restriction base="AbstractChar">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="0" name="dict" type="Dict"/>
        </xs:sequence>
        <xs:attribute name="tokens" type="MSTokenLength" use="prohibited"/>
        <xs:attribute name="states" type="xs:NCName" use="prohibited"/>
        <xs:attribute name="codon" type="CodonPosition" use="prohibited"/>
        <xs:attribute name="id" type="xs:NCName" use="required"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

AAFormat

Description

The AAFormat class is the container of amino acid column definitions.

Inheritance

AAFormat restricts AbstractFormat.

Substructures
Definition source
  <xs:complexType abstract="false" name="AAFormat">
    <xs:complexContent>
      <xs:restriction base="AbstractFormat">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="1" name="char" type="AAChar"/>
        </xs:sequence>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

AAMatrixObsRow

Description

This is a row in a matrix of amino acid data containing granular observations.

Inheritance

AAMatrixObsRow restricts AbstractObsRow.

Substructures
Definition source
  <xs:complexType abstract="false" name="AAMatrixObsRow">
    <xs:complexContent>
      <xs:restriction base="AbstractObsRow">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="0" name="dict" type="Dict"/>
          <xs:element maxOccurs="unbounded" minOccurs="1" name="cell" type="AAObs"/>
        </xs:sequence>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

AAMatrixSeqRow

Description

This is a row in a matrix of amino acid data containing raw sequence data.

Inheritance

AAMatrixSeqRow restricts AbstractSeqRow.

Substructures
Definition source
  <xs:complexType abstract="false" name="AAMatrixSeqRow">
    <xs:complexContent>
      <xs:restriction base="AbstractSeqRow">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="0" name="dict" type="Dict"/>
          <xs:element maxOccurs="1" minOccurs="1" name="seq" type="AASeq"/>
        </xs:sequence>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

AAObs

Description

This is a single cell in a matrix containing an amino acid observation.

Inheritance

AAObs restricts AbstractObs.

Attributes
Name Type Usage
char xs:nonNegativeInteger required
state AAToken required
Substructures
Definition source
  <xs:complexType abstract="false" name="AAObs" sawsdl:modelReference="http://evolutionaryontology-dev.nescent.org/cdao.owl#AminoAcidResidueStateDatum">
    <xs:complexContent>
      <xs:restriction base="AbstractObs">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="0" name="dict" type="Dict"/>
        </xs:sequence>
        <xs:attribute name="char" type="xs:nonNegativeInteger" use="required"/>
        <xs:attribute name="state" type="AAToken" use="required"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

AAObsMatrix

Description

A matrix of rows with single character observations.

Inheritance

AAObsMatrix restricts AbstractObsMatrix.

Substructures
Definition source
  <xs:complexType abstract="false" name="AAObsMatrix">
    <xs:complexContent>
      <xs:restriction base="AbstractObsMatrix">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="1" name="row" type="AAMatrixObsRow"/>
        </xs:sequence>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

AASeqMatrix

Description

A matrix of rows with amino acid data as sequence strings.

Inheritance

AASeqMatrix restricts AbstractSeqMatrix.

Substructures
Definition source
  <xs:complexType abstract="false" name="AASeqMatrix">
    <xs:complexContent>
      <xs:restriction base="AbstractSeqMatrix">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="1" name="row" type="AAMatrixSeqRow"/>
        </xs:sequence>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

ProteinCells

Description

An amino acid characters block consisting of granular cells preceded by metadata.

Inheritance

ProteinCells restricts AbstractCells.

Substructures
Definition source
  <xs:complexType abstract="false" name="ProteinCells">
    <xs:complexContent>
      <xs:restriction base="AbstractCells">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="0" name="dict" type="Dict"/>
          <xs:element maxOccurs="1" minOccurs="0" name="format" type="AAFormat"/>
          <xs:element maxOccurs="1" minOccurs="1" name="matrix" type="AAObsMatrix"/>
        </xs:sequence>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

ProteinSeqs

Description

An amino acid characters block consisting of sequences preceded by metadata.

Inheritance

ProteinSeqs restricts AbstractSeqs.

Substructures
Definition source
  <xs:complexType abstract="false" name="ProteinSeqs">
    <xs:complexContent>
      <xs:restriction base="AbstractSeqs">
        <xs:sequence maxOccurs="1" minOccurs="1">
          <xs:element maxOccurs="unbounded" minOccurs="0" name="dict" type="Dict"/>
          <xs:element maxOccurs="1" minOccurs="0" name="format" type="AAFormat"/>
          <xs:element maxOccurs="1" minOccurs="1" name="matrix" type="AASeqMatrix"/>
        </xs:sequence>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

Simple types

AASeq

Description

This is a simple type that specifies a sequence of amino acid characters, following IUPAC single character symbols for aa (and ambiguities).

Inheritance

AASeq restricts AbstractSeq.

Facets
Type Value
pattern [ABCDEFGHIKLMNPQRSTUVWXYZ\s]*
Definition source
  <xs:simpleType name="AASeq">
    <xs:restriction base="AbstractSeq">
      <xs:pattern value="[ABCDEFGHIKLMNPQRSTUVWXYZ\s]*"/>
    </xs:restriction>
  </xs:simpleType>

AAToken

Description

The AAToken simple type is a restriction of AbstractSymbol that defines an IUPAC single character amino acid symbol.

Inheritance

AAToken restricts AbstractSymbol.

Facets
Type Value
pattern [ABCDEFGHIKLMNPQRSTUVWXYZ]
length 1
Definition source
  <xs:simpleType name="AAToken">
    <xs:restriction base="AbstractSymbol">
      <xs:pattern value="[ABCDEFGHIKLMNPQRSTUVWXYZ]"/>
      <xs:length value="1"/>
    </xs:restriction>
  </xs:simpleType>

See also

~/nexml/xsd/characters/protein.xsd includes the following other files:

~/nexml/xsd/characters/protein.xsd is included in:

This page summarizes the contents of schema file "~/nexml/xsd/characters/protein.xsd" as of revision 654. That schema file was last modified by rvos on 2008-09-22 15:17:49 -0400 (Mon, 22 Sep 2008), and this HTML summary page was generated on Tue Dec 16 16:26:28 2008.