head 1.17;
access;
symbols;
locks; strict;
comment @# @;
expand @o@;
1.17
date 2010.01.18.14.44.32; author GregorHaggedorn; state Exp;
branches;
next 1.16;
1.16
date 2009.11.25.03.14.35; author GarryJolleyRogers; state Exp;
branches;
next 1.15;
1.15
date 2009.11.20.02.45.28; author LeeBelbin; state Exp;
branches;
next 1.14;
1.14
date 2008.11.20.17.42.47; author GregorHagedorn; state Exp;
branches;
next 1.13;
1.13
date 2008.06.07.20.45.30; author GregorHagedorn; state Exp;
branches;
next 1.12;
1.12
date 2008.06.04.16.53.17; author GregorHagedorn; state Exp;
branches;
next 1.11;
1.11
date 2008.05.14.11.14.38; author GregorHagedorn; state Exp;
branches;
next 1.10;
1.10
date 2008.04.30.23.24.10; author GregorHagedorn; state Exp;
branches;
next 1.9;
1.9
date 2008.04.29.13.27.07; author GregorHagedorn; state Exp;
branches;
next 1.8;
1.8
date 2008.04.28.20.01.50; author GregorHagedorn; state Exp;
branches;
next 1.7;
1.7
date 2008.03.28.16.28.34; author GregorHagedorn; state Exp;
branches;
next 1.6;
1.6
date 2007.10.16.17.25.57; author GregorHagedorn; state Exp;
branches;
next 1.5;
1.5
date 2007.10.16.12.02.01; author GregorHagedorn; state Exp;
branches;
next 1.4;
1.4
date 2007.10.13.13.13.10; author GregorHagedorn; state Exp;
branches;
next 1.3;
1.3
date 2007.10.12.15.50.25; author GregorHagedorn; state Exp;
branches;
next 1.2;
1.2
date 2007.10.12.14.21.53; author GregorHagedorn; state Exp;
branches;
next 1.1;
1.1
date 2007.10.12.11.34.25; author GregorHagedorn; state Exp;
branches;
next ;
desc
@none
@
1.17
log
@none
@
text
@%META:TOPICINFO{author="GregorHaggedorn" date="1263825872" format="1.1" version="1.17"}%
%META:TOPICPARENT{name="WebHome"}%
Real world examples (SDD 1.1)
To help application developers in understanding SDD and testing their software, we provide a number of "real world" example data sets for SDD, version 1.1. We try to provide different data sets with different properties and originating from different applications.
1. Data sets for multi-access keys ("matrix keys")
Transformed DELTA example data set
Introduction: The following two datasets are either distributed together with the CSIRO DELTA programs, or used in feature comparisons. They are provided here to help people with a DELTA background to understand the relation between SDD and DELTA.
Description of data sets: The first data set is a minimal set with 4 characters and 3 descriptions of beetles. It is used as the example on Data Requirements for Natural-language Descriptions and Identification and provided there in various formats (DELTA, NEXUS, Lucid Interchange Format File v. 1.1 (old version of Lucid), and XDELTA). The second set contains a larger character set and 14 grass species. It is distributed together with the DELTA programs (version from 2000, see "All programs (including Intkey)" on DELTA Programs and Documentation). The original DELTA file is ANSI (not ASCII) encoded and uses RTF for character markup. This example is provided both as a single-document and as a multifile xml-document set. The multifile approach uses multiple xml fragments that can be individually edited or placed in different repositories and which finally can be combined using xml entities into a "master-document". To some extent this mirror the most common use of DELTA using a folder plus multiple directive files.
Data conversion: Both datasets were initially converted by importing the DELTA data into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions 2.0 beta 10]] and exported to SDD 1.1 from there. Since the SDD produced in this way contains various DiversityDescriptions-specific information, the datasets were slightly cleaned by hand afterwards.
Copyright and license: Both datasets are used by specific permission by Mike Dallwitz 2008. They are not placed under a general license.
SDD documents:
FOLLOWING ARE 3 BROKEN LINKS - NEED TO BE FIXED:
* [[%ATTACHURL%/Beetles.sdd11.zip][Beetles.sdd11.zip]]: Beetles dataset in SDD 1.1 format
* [[%ATTACHURL%/DELTA2000-Sample.sdd11.zip][DELTA2000-Sample.sdd11.zip]]: DELTA.exe (2000) sample dataset on 14 Grass genera in SDD 1.1 format
* [[%ATTACHURL%/DELTA2000_Multifile.sdd11.zip][DELTA2000_Multifile.sdd11.zip]]: As above, but using multiple xml fragments combined into a master document
----
LIAS data set
Introduction: [[http://www.lias.net/][LIAS]] is a global information system for Lichenized and Non-Lichenized Ascomycetes. The vision of LIAS is to establish a non-commercial global information system for the collection and distribution of descriptive, phylogenetic, and other biodiversity data on these taxonomic groups that uses advanced technology and where published biodiversity data of all ascomycetes are joined in a multi-authored database and used for the most sophisticated queries. Specific goals are to
* provide a working space for cooperation and collaboration of experts on ascomycetes in the Internet
* establish a multi-authored worldwide database on descriptive data of all ascomycetes
* design user-friendly web tools for an easier access and remote editing of database records via Internet
* offer a online database system for multiple usage and therewith dissemination of expert knowledge especially by providing public access to database generated identification keys and natural language description of ascomycetes
* promote the gathering, furnishing and administration of data by experts in a standard database system which allows an information deposit for individual use only (e. g. for revision) and – after agreement – the public access to the data via Internet
* promote common standards on descriptive data connected with taxonomic names of ascomycetes to facilitate interoperability and data exchange
LIAS is the work of many collaborators. The primary editors are G. Rambold and D. Triebel. Cooperating Institutions are the University of Bayreuth, Department of Mycology, Botanische Staatssammlung München, Arizona State University, Department of Plant Biology, University of Hamburg, Herbarium Hamburgense, and the University of Oslo, Botanical Museum. It is or was supported by funds from the Bayerisches Staatsministerium für Wissenschaft, Forschung und Kunst, Bundesministerium für Bildung und Wissenschaft (BMBF), Deutsche Forschungsgemeinschaft (DFG), and Staatliche Naturwissenschaftliche Sammlungen Bayerns.
Description of data set: The data set provided here is the complete [[http://www.lias.net/][LIAS]] main data set as of 2007-07. It provides descriptions of 2480 genera and species of lichens using 987 characters with a total of 7632 categorical state definitions (plus 3128 status values or statistical measures for quantitative characters). The descriptions are atomized to a total of 221821 values. Only relatively few characters and states are "pseudo-" or "management-characters", dealing with taxonomy, revision management, etc. Of the total LIAS main character data matrix of 2480x987 = 2447760 cells, 157041 cells are filled (6.4%). Part of this low fill factor is due to the taxonomic diversity encompassed in the data set, but it also shows that significant work still has to be done.
Related data sets: Two datasets are closely related with "LIAS main": (1) [[http://liaslight.lias.net/][LIAS light]] contains fewer characters but has been more extensively revised and has a higher fill factor. It is therefore more suitable for practical identification and currently strongly expanded as part of two major joint projects: the [[http://www.biota-africa.org][BIOTA Africa]] project and the [[http://nhc.asu.edu/lichens/flora/flora.jsp][Greater Sonoran Desert Lichen Flora]]. (2) A key to around 700 powdery mildews (Erysiphales), which for reasons convenience is coupled with LIAS main, has been excluded from this release.
Data conversion: The "LIAS main" dataset is managed in [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]]; the attached SDD 1.1 export file was created by the DiversityDescriptions export routine.
Highlights for testing SDD: The LIAS dataset is a large dataset and is especially suitable for testing the behavior of an application with large and rich keys.
Copyright and license: The "LIAS main" dataset attached here is © 1996-2007 by Botanische Staatssammlung München. All rights reserved. It is here released under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5. Further details are included in the file itself.
SDD document: [[%ATTACHURL%/LIAS_Main.sdd11.zip][LIAS_Main.sdd11.zip]]: LIAS Main dataset in SDD 1.1 format
----
Interactive Key to Species of Erythroneura
Introduction: The [[http://ctap.inhs.uiuc.edu/dmitriev/key.asp?key=Erythroneura&lng=En&i=1&keyN=2][Interactive Key to Species of the Genus Erythroneura (Homoptera, Cicadellidae)]] by D. Dmitriev & C. Dietrich is also available online under the [[http://ctap.inhs.uiuc.edu/dmitriev/][3I software]] created by Dmitry A. Dmitriev. 3I (Internet-accessible Interactive Identification) is a set of software tools for creating on-line identification keys, taxonomic databases, and virtual taxonomic revisions. By organizing illustrations and nomenclatural, morphological, bibliographical, and distributional data into a single database 3I also facilitates production of traditional, printed taxonomic papers and monographs. As such it is more comprehensive that SDD alone, pointing into the direction into which SDD plans to evolve (online monographs including nomenclature as well as descriptions and identification tools).
Description of data set: The data set is a small sized key for 54 taxa, using 42 characters, 171 categorical state definitions, and 2401 values. It contains only a single quantitative character.
Data conversion: The export to SDD occurred indirectly, importing the original 3I database into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]] (converter available since version 2.0) and creating SDD from there. As a result, some details (specimen, nomenclature, publication data), which could in principle be expressed in SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
Highlights for testing SDD: The dataset is a small fully revised and published dataset with rich illustrations. Although the images are not included here, as of 2007-10-12 the given URLs were resolvable. Note: the dataset does not use any Status values ("unknown", "not applicable", etc.).
Copyright and license: The Erythroneura dataset attached here is © 2003-2006 D. Dmitriev & C. Dietrich. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5.
SDD document: [[%ATTACHURL%/Erythroneura.sdd11.zip][Erythroneura.sdd11.zip]]: D.Dmitriev's Erythroneura key in SDD 1.1 format
----
An Interactive Key to Tribes of Leafhoppers / Интерактивная Определительная Таблица Цикадок (Cicadellidae, in English and Russian)
Introduction: This key by D. Dmitriev & C. Dietrich is used to demonstrate the multilingual properties of the [[http://ctap.inhs.uiuc.edu/dmitriev/][3I software]] and is available in [[http://ctap.inhs.uiuc.edu/dmitriev/key.asp?key=Cicnymph&i=1&lng=En][English]] and [[http://ctap.inhs.uiuc.edu/dmitriev/key.asp?key=Cicnymph&i=1&lng=Ru][Russian]]. See the "Interactive Key to Species of Erythroneura" above for further information on 3I.
Description of data set: The data set is a small to medium sized key for 152 taxa, using 146 characters, 414 categorical state definitions, and 13252 values. It contains no quantitative or text characters. The revision of the dataset is not complete.
Data conversion: The export to SDD occurred indirectly, importing the original 3I database into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]] (converter available since version 2.0) and creating SDD from there. As a result, some details that could in principle be expressed in SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
Highlights for testing SDD: This dataset is provided as a fully multilingual dataset. Note that at the moment the natural language features are only partly exported in both languages; this is solely due to incomplete conversion, neither to 3I nor SDD.
Copyright and license: The "Key to Tribes of Leafhoppers" dataset attached here is © 2003-2006 D. Dmitriev & C. Dietrich. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5.
SDD document: [[%ATTACHURL%/Cicad.sdd11.zip][Cicad.sdd11.zip]]: D.Dmitriev's English/Russian multilingual example data set.
----
2. Data sets for natural language descriptions including markup
(None at the moment, please help us providing such a data set!)
----
3. Data sets for branching (static dichotomous or polytomous) keys
Dichotomous key to higher plants from Val Rosandra (Italy)
This SDD dataset is an export of the [[http://www.dryades.eu][FRIDA]] key to the higher plants of the [[http://www.comune.san-dorligo-della-valle.ts.it/][Val Rosandra nature reserve]] in Italy. The original FRIDA key is available [[http://dbiodbs.units.it/carso/chiavi_pub21?sc=67][online]]. The dataset has been created as a prototype for more widespread adoption of SDD in the context of the [[http://www.keytonature.eu/][Key to Nature]] EU project.
Description of data set: The data set is a medium to large sized dichotomous key covering 1149 taxa in 1154 couplets (2308 leads). 1949 images are linked into the key. The dichotomous key itself is fully translated to English. It key contains a single inner reticulation (where a lead can be reached by multiple paths) and many "terminal reticulations", i.e. taxa that are keyed out multiple times. It also contains 400 Italian natural language descriptions. In addition to the real FRIDA key, the dataset contains a second dummy key, to illustrate two points: a) a dataset may have multiple labeled keys, b) the optional question/answer style available in SDD.
Data conversion: The dataset is semi-manual prototype export from the FRIDA database. It is planned that the export routine will be fully automatized and that all available FRIDA keys will in the future be also available in SDD format.
Copyright and license: The "Val Rosandra" dataset attached here is © 2008 P.L. Nimis & S. Martellos. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license (Creative Commons 3.0 NC-BY-SA unported).
SDD document: [[%ATTACHURL%/Val-Rosandra-FRIDA-Key.sdd11.zip][Val-Rosandra-FRIDA-Key.sdd11.zip]]: Dichotomous key to higher plants from Val Rosandra (Italy).
Key to Dutch reptiles and amphibians (by ETI)
The dataset has been created as a prototype while implementing SDD in the ETI BioInformatics mobile key created in the context of the [[http://www.keytonature.eu/][Key to Nature]] EU project. Its goal is to create a small, but realistic identification dataset for testing purposes, combining several features of SDD.
Description of data set: The taxon names here contain atomized data (CanonicalName; this is the only dataset that features this), the key is dual language in Dutch and English. The key contains only categorical characters (no quantitative or text). The characters are labeled in question style, with the states giving the answers. Each taxon has a short Natural Language description (plain text without semantic markup; note that the English text is not a fully reflection of the Dutch). The key contains both coded descriptions to use with a multi-access key, and a manually created, fixed single-access key (polytomous). The latter in part uses question/answer style ("Does it have legs? yes/no"), in part couplet style with leads ("Warty skin, pupil horizontal/Warty skin, pupil vertical/Smooth skin, pupil vertical"). The size of the data set is small, with 24 taxa and 20 characters.
Data conversion: The dataset is semi-manual prototype export from ETI data.
Copyright and license: The dataset attached here is © 2008 ETI. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license (Creative Commons 3.0 NC-BY-SA unported).
SDD document: [[%ATTACHURL%/ETI_rept_amph_key.sdd11.xml.zip][ETI_rept_amph_key.sdd11.xml.zip]]: Key to Dutch reptiles and amphibians (by ETI)
----
-- Main.GregorHagedorn - 20 Nov 2008
%META:FILEATTACHMENT{name="LIAS_Main.sdd11.zip" attachment="LIAS_Main.sdd11.zip" attr="h" comment="LIAS Main dataset in SDD 1.1 format" date="1192533914" path="LIAS_Main.sdd11.zip" size="1107994" stream="LIAS_Main.sdd11.zip" user="Main.GregorHagedorn" version="1"}%
%META:FILEATTACHMENT{name="Erythroneura.sdd11.zip" attachment="Erythroneura.sdd11.zip" attr="h" comment="D.Dmitriev's Erythroneura key in SDD 1.1 format" date="1206721585" path="Erythroneura.sdd11.zip" size="110123" stream="Erythroneura.sdd11.zip" user="Main.GregorHagedorn" version="3"}%
%META:FILEATTACHMENT{name="Cicad.sdd11.zip" attachment="Cicad.sdd11.zip" attr="h" comment="D.Dmitriev's Russian/English key to Tribes of Leafhoppers in SDD 1.1" date="1192555525" path="Cicad.sdd11.zip" size="64791" stream="Cicad.sdd11.zip" user="Main.GregorHagedorn" version="1"}%
%META:FILEATTACHMENT{name="Val-Rosandra-FRIDA-Key.sdd11.zip" attachment="Val-Rosandra-FRIDA-Key.sdd11.zip" attr="h" comment="Dichotomous key to higher plants from Val Rosandra (Italy)" date="1212871529" path="Val-Rosandra-FRIDA-Key.sdd11.zip" size="250889" stream="Val-Rosandra-FRIDA-Key.sdd11.zip" user="Main.GregorHagedorn" version="4"}%
%META:FILEATTACHMENT{name="ETI_rept_amph_key.sdd11.xml.zip" attachment="ETI_rept_amph_key.sdd11.xml.zip" attr="h" comment="Key to Dutch reptiles and amphibians in SDD 1.1 format" date="1227202363" path="ETI_rept_amph_key.sdd11.xml.zip" size="15596" stream="ETI_rept_amph_key.sdd11.xml.zip" user="Main.GregorHagedorn" version="1"}%
@
1.16
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GarryJolleyRogers" date="1259118875" format="1.1" version="1.16"}%
d20 2
@
1.15
log
@none
@
text
@d1 3
a3 3
%META:TOPICINFO{author="LeeBelbin" date="1258685128" format="1.1" reprev="1.15" version="1.15"}%
%META:TOPICPARENT{name="BDI.SDD"}%
Real world examples (BDI.SDD 1.1)
d5 1
a5 1
To help application developers in understanding BDI.SDD and testing their software, we provide a number of "real world" example data sets for BDI.SDD, version 1.1. We try to provide different data sets with different properties and originating from different applications.
d11 1
a11 1
Introduction: The following two datasets are either distributed together with the CSIRO DELTA programs, or used in feature comparisons. They are provided here to help people with a DELTA background to understand the relation between BDI.SDD and DELTA.
d15 1
a15 1
Data conversion: Both datasets were initially converted by importing the DELTA data into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions 2.0 beta 10]] and exported to BDI.SDD 1.1 from there. Since the BDI.SDD produced in this way contains various DiversityDescriptions-specific information, the datasets were slightly cleaned by hand afterwards.
d19 3
a21 3
BDI.SDD documents:
* [[%ATTACHURL%/Beetles.sdd11.zip][Beetles.sdd11.zip]]: Beetles dataset in BDI.SDD 1.1 format
* [[%ATTACHURL%/DELTA2000-Sample.sdd11.zip][DELTA2000-Sample.sdd11.zip]]: DELTA.exe (2000) sample dataset on 14 Grass genera in BDI.SDD 1.1 format
d43 1
a43 1
Data conversion: The "LIAS main" dataset is managed in [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]]; the attached BDI.SDD 1.1 export file was created by the DiversityDescriptions export routine.
d45 1
a45 1
Highlights for testing BDI.SDD: The LIAS dataset is a large dataset and is especially suitable for testing the behavior of an application with large and rich keys.
d49 1
a49 1
BDI.SDD document: [[%ATTACHURL%/LIAS_Main.sdd11.zip][LIAS_Main.sdd11.zip]]: LIAS Main dataset in BDI.SDD 1.1 format
d55 1
a55 1
Introduction: The [[http://ctap.inhs.uiuc.edu/dmitriev/key.asp?key=Erythroneura&lng=En&i=1&keyN=2][Interactive Key to Species of the Genus Erythroneura (Homoptera, Cicadellidae)]] by D. Dmitriev & C. Dietrich is also available online under the [[http://ctap.inhs.uiuc.edu/dmitriev/][3I software]] created by Dmitry A. Dmitriev. 3I (Internet-accessible Interactive Identification) is a set of software tools for creating on-line identification keys, taxonomic databases, and virtual taxonomic revisions. By organizing illustrations and nomenclatural, morphological, bibliographical, and distributional data into a single database 3I also facilitates production of traditional, printed taxonomic papers and monographs. As such it is more comprehensive that BDI.SDD alone, pointing into the direction into which BDI.SDD plans to evolve (online monographs including nomenclature as well as descriptions and identification tools).
d59 1
a59 1
Data conversion: The export to BDI.SDD occurred indirectly, importing the original 3I database into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]] (converter available since version 2.0) and creating BDI.SDD from there. As a result, some details (specimen, nomenclature, publication data), which could in principle be expressed in BDI.SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
d61 1
a61 1
Highlights for testing BDI.SDD: The dataset is a small fully revised and published dataset with rich illustrations. Although the images are not included here, as of 2007-10-12 the given URLs were resolvable. Note: the dataset does not use any Status values ("unknown", "not applicable", etc.).
d63 1
a63 1
Copyright and license: The Erythroneura dataset attached here is © 2003-2006 D. Dmitriev & C. Dietrich. The BDI.SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5.
d65 1
a65 1
BDI.SDD document: [[%ATTACHURL%/Erythroneura.sdd11.zip][Erythroneura.sdd11.zip]]: D.Dmitriev's Erythroneura key in BDI.SDD 1.1 format
d75 1
a75 1
Data conversion: The export to BDI.SDD occurred indirectly, importing the original 3I database into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]] (converter available since version 2.0) and creating BDI.SDD from there. As a result, some details that could in principle be expressed in BDI.SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
d77 1
a77 1
Highlights for testing BDI.SDD: This dataset is provided as a fully multilingual dataset. Note that at the moment the natural language features are only partly exported in both languages; this is solely due to incomplete conversion, neither to 3I nor BDI.SDD.
d79 1
a79 1
Copyright and license: The "Key to Tribes of Leafhoppers" dataset attached here is © 2003-2006 D. Dmitriev & C. Dietrich. The BDI.SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5.
d81 1
a81 1
BDI.SDD document: [[%ATTACHURL%/Cicad.sdd11.zip][Cicad.sdd11.zip]]: D.Dmitriev's English/Russian multilingual example data set.
d95 1
a95 1
This BDI.SDD dataset is an export of the [[http://www.dryades.eu][FRIDA]] key to the higher plants of the [[http://www.comune.san-dorligo-della-valle.ts.it/][Val Rosandra nature reserve]] in Italy. The original FRIDA key is available [[http://dbiodbs.units.it/carso/chiavi_pub21?sc=67][online]]. The dataset has been created as a prototype for more widespread adoption of BDI.SDD in the context of the [[http://www.keytonature.eu/][Key to Nature]] EU project.
d97 1
a97 1
Description of data set: The data set is a medium to large sized dichotomous key covering 1149 taxa in 1154 couplets (2308 leads). 1949 images are linked into the key. The dichotomous key itself is fully translated to English. It key contains a single inner reticulation (where a lead can be reached by multiple paths) and many "terminal reticulations", i.e. taxa that are keyed out multiple times. It also contains 400 Italian natural language descriptions. In addition to the real FRIDA key, the dataset contains a second dummy key, to illustrate two points: a) a dataset may have multiple labeled keys, b) the optional question/answer style available in BDI.SDD.
d99 1
a99 1
Data conversion: The dataset is semi-manual prototype export from the FRIDA database. It is planned that the export routine will be fully automatized and that all available FRIDA keys will in the future be also available in BDI.SDD format.
d101 1
a101 1
Copyright and license: The "Val Rosandra" dataset attached here is © 2008 P.L. Nimis & S. Martellos. The BDI.SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license (Creative Commons 3.0 NC-BY-SA unported).
d103 1
a103 1
BDI.SDD document: [[%ATTACHURL%/Val-Rosandra-FRIDA-Key.sdd11.zip][Val-Rosandra-FRIDA-Key.sdd11.zip]]: Dichotomous key to higher plants from Val Rosandra (Italy).
d107 1
a107 1
The dataset has been created as a prototype while implementing BDI.SDD in the ETI BioInformatics mobile key created in the context of the [[http://www.keytonature.eu/][Key to Nature]] EU project. Its goal is to create a small, but realistic identification dataset for testing purposes, combining several features of BDI.SDD.
d113 1
a113 1
Copyright and license: The dataset attached here is © 2008 ETI. The BDI.SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license (Creative Commons 3.0 NC-BY-SA unported).
d115 1
a115 1
BDI.SDD document: [[%ATTACHURL%/ETI_rept_amph_key.sdd11.xml.zip][ETI_rept_amph_key.sdd11.xml.zip]]: Key to Dutch reptiles and amphibians (by ETI)
@
1.14
log
@none
@
text
@d1 3
a3 3
%META:TOPICINFO{author="GregorHagedorn" date="1227202967" format="1.1" reprev="1.14" version="1.14"}%
%META:TOPICPARENT{name="WebHome"}%
Real world examples (SDD 1.1)
d5 1
a5 1
To help application developers in understanding SDD and testing their software, we provide a number of "real world" example data sets for SDD, version 1.1. We try to provide different data sets with different properties and originating from different applications.
d11 1
a11 1
Introduction: The following two datasets are either distributed together with the CSIRO DELTA programs, or used in feature comparisons. They are provided here to help people with a DELTA background to understand the relation between SDD and DELTA.
d15 1
a15 1
Data conversion: Both datasets were initially converted by importing the DELTA data into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions 2.0 beta 10]] and exported to SDD 1.1 from there. Since the SDD produced in this way contains various DiversityDescriptions-specific information, the datasets were slightly cleaned by hand afterwards.
d19 3
a21 3
SDD documents:
* [[%ATTACHURL%/Beetles.sdd11.zip][Beetles.sdd11.zip]]: Beetles dataset in SDD 1.1 format
* [[%ATTACHURL%/DELTA2000-Sample.sdd11.zip][DELTA2000-Sample.sdd11.zip]]: DELTA.exe (2000) sample dataset on 14 Grass genera in SDD 1.1 format
d43 1
a43 1
Data conversion: The "LIAS main" dataset is managed in [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]]; the attached SDD 1.1 export file was created by the DiversityDescriptions export routine.
d45 1
a45 1
Highlights for testing SDD: The LIAS dataset is a large dataset and is especially suitable for testing the behavior of an application with large and rich keys.
d49 1
a49 1
SDD document: [[%ATTACHURL%/LIAS_Main.sdd11.zip][LIAS_Main.sdd11.zip]]: LIAS Main dataset in SDD 1.1 format
d55 1
a55 1
Introduction: The [[http://ctap.inhs.uiuc.edu/dmitriev/key.asp?key=Erythroneura&lng=En&i=1&keyN=2][Interactive Key to Species of the Genus Erythroneura (Homoptera, Cicadellidae)]] by D. Dmitriev & C. Dietrich is also available online under the [[http://ctap.inhs.uiuc.edu/dmitriev/][3I software]] created by Dmitry A. Dmitriev. 3I (Internet-accessible Interactive Identification) is a set of software tools for creating on-line identification keys, taxonomic databases, and virtual taxonomic revisions. By organizing illustrations and nomenclatural, morphological, bibliographical, and distributional data into a single database 3I also facilitates production of traditional, printed taxonomic papers and monographs. As such it is more comprehensive that SDD alone, pointing into the direction into which SDD plans to evolve (online monographs including nomenclature as well as descriptions and identification tools).
d59 1
a59 1
Data conversion: The export to SDD occurred indirectly, importing the original 3I database into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]] (converter available since version 2.0) and creating SDD from there. As a result, some details (specimen, nomenclature, publication data), which could in principle be expressed in SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
d61 1
a61 1
Highlights for testing SDD: The dataset is a small fully revised and published dataset with rich illustrations. Although the images are not included here, as of 2007-10-12 the given URLs were resolvable. Note: the dataset does not use any Status values ("unknown", "not applicable", etc.).
d63 1
a63 1
Copyright and license: The Erythroneura dataset attached here is © 2003-2006 D. Dmitriev & C. Dietrich. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5.
d65 1
a65 1
SDD document: [[%ATTACHURL%/Erythroneura.sdd11.zip][Erythroneura.sdd11.zip]]: D.Dmitriev's Erythroneura key in SDD 1.1 format
d75 1
a75 1
Data conversion: The export to SDD occurred indirectly, importing the original 3I database into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]] (converter available since version 2.0) and creating SDD from there. As a result, some details that could in principle be expressed in SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
d77 1
a77 1
Highlights for testing SDD: This dataset is provided as a fully multilingual dataset. Note that at the moment the natural language features are only partly exported in both languages; this is solely due to incomplete conversion, neither to 3I nor SDD.
d79 1
a79 1
Copyright and license: The "Key to Tribes of Leafhoppers" dataset attached here is © 2003-2006 D. Dmitriev & C. Dietrich. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5.
d81 1
a81 1
SDD document: [[%ATTACHURL%/Cicad.sdd11.zip][Cicad.sdd11.zip]]: D.Dmitriev's English/Russian multilingual example data set.
d95 1
a95 1
This SDD dataset is an export of the [[http://www.dryades.eu][FRIDA]] key to the higher plants of the [[http://www.comune.san-dorligo-della-valle.ts.it/][Val Rosandra nature reserve]] in Italy. The original FRIDA key is available [[http://dbiodbs.units.it/carso/chiavi_pub21?sc=67][online]]. The dataset has been created as a prototype for more widespread adoption of SDD in the context of the [[http://www.keytonature.eu/][Key to Nature]] EU project.
d97 1
a97 1
Description of data set: The data set is a medium to large sized dichotomous key covering 1149 taxa in 1154 couplets (2308 leads). 1949 images are linked into the key. The dichotomous key itself is fully translated to English. It key contains a single inner reticulation (where a lead can be reached by multiple paths) and many "terminal reticulations", i.e. taxa that are keyed out multiple times. It also contains 400 Italian natural language descriptions. In addition to the real FRIDA key, the dataset contains a second dummy key, to illustrate two points: a) a dataset may have multiple labeled keys, b) the optional question/answer style available in SDD.
d99 1
a99 1
Data conversion: The dataset is semi-manual prototype export from the FRIDA database. It is planned that the export routine will be fully automatized and that all available FRIDA keys will in the future be also available in SDD format.
d101 1
a101 1
Copyright and license: The "Val Rosandra" dataset attached here is © 2008 P.L. Nimis & S. Martellos. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license (Creative Commons 3.0 NC-BY-SA unported).
d103 1
a103 1
SDD document: [[%ATTACHURL%/Val-Rosandra-FRIDA-Key.sdd11.zip][Val-Rosandra-FRIDA-Key.sdd11.zip]]: Dichotomous key to higher plants from Val Rosandra (Italy).
d107 1
a107 1
The dataset has been created as a prototype while implementing SDD in the ETI BioInformatics mobile key created in the context of the [[http://www.keytonature.eu/][Key to Nature]] EU project. Its goal is to create a small, but realistic identification dataset for testing purposes, combining several features of SDD.
d113 1
a113 1
Copyright and license: The dataset attached here is © 2008 ETI. The SDD version is released here under the Creative Commons non-commercial, by attribution, share-alike license (Creative Commons 3.0 NC-BY-SA unported).
d115 1
a115 1
SDD document: [[%ATTACHURL%/ETI_rept_amph_key.sdd11.xml.zip][ETI_rept_amph_key.sdd11.xml.zip]]: Key to Dutch reptiles and amphibians (by ETI)
@
1.13
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1212871529" format="1.1" version="1.13"}%
d95 1
a95 1
This SDD dataset is an export of the [[http://www.dryades.eu][FRIDA]] key to the higher plants of the [[http://www.comune.san-dorligo-della-valle.ts.it/][Val Rosandra nature reserve]] in Italy. The original FRIDA key is available [[http://dbiodbs.units.it/carso/chiavi_pub21?sc=67][online]]. The dataset has been created as a prototype for more widespread adoption of SDD in the context of the [[http://www.keytonature.eu/en/index.html][Key to Nature]] EU project.
d105 13
d120 1
a120 1
-- Main.GregorHagedorn - 28 April 2008
d126 1
@
1.12
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1212598397" format="1.1" version="1.12"}%
d112 1
a112 1
%META:FILEATTACHMENT{name="Val-Rosandra-FRIDA-Key.sdd11.zip" attachment="Val-Rosandra-FRIDA-Key.sdd11.zip" attr="h" comment="Dichotomous key to higher plants from Val Rosandra (Italy)" date="1212598396" path="Val-Rosandra-FRIDA-Key.sdd11.zip" size="250993" stream="Val-Rosandra-FRIDA-Key.sdd11.zip" user="Main.GregorHagedorn" version="3"}%
@
1.11
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1210763678" format="1.1" version="1.11"}%
d112 1
a112 1
%META:FILEATTACHMENT{name="Val-Rosandra-FRIDA-Key.sdd11.zip" attachment="Val-Rosandra-FRIDA-Key.sdd11.zip" attr="h" comment="Dichotomous key to higher plants from Val Rosandra (Italy)" date="1209597850" path="Val-Rosandra-FRIDA-Key.sdd11.zip" size="250755" stream="Val-Rosandra-FRIDA-Key.sdd11.zip" user="Main.GregorHagedorn" version="2"}%
@
1.10
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1209597850" format="1.1" version="1.10"}%
d9 18
@
1.9
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1209475627" format="1.1" version="1.9"}%
d94 1
a94 1
%META:FILEATTACHMENT{name="Val-Rosandra-FRIDA-Key.sdd11.zip" attachment="Val-Rosandra-FRIDA-Key.sdd11.zip" attr="h" comment="Dichotomous key to higher plants from Val Rosandra (Italy)" date="1209412039" path="Val-Rosandra-FRIDA-Key.sdd11.zip" size="252787" stream="Val-Rosandra-FRIDA-Key.sdd11.zip" user="Main.GregorHagedorn" version="1"}%
@
1.8
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1209412910" format="1.1" reprev="1.8" version="1.8"}%
d79 1
a79 1
Description of data set: The data set is a medium to large sized dichotomous key covering 1149 taxa in 1154 couplets (2308 leads). 1949 images are linked into the key. The dichotomous key itself is fully translated to English. It key contains a single inner reticulation (where a lead can be reached by multiple paths) and many "terminal reticulations", i.e. taxa that are keyed out multiple times. It also contains 400 Italian natural language descriptions.
@
1.7
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1206721714" format="1.1" reprev="1.7" version="1.7"}%
d75 11
a85 1
(None at the moment, please help us providing such a data set!)
d89 1
a89 1
-- Main.GregorHagedorn - 16 Oct 2007
d94 1
@
1.6
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1192555557" format="1.1" reprev="1.6" version="1.6"}%
d43 1
a43 1
Highlights for testing SDD: The dataset is a small fully revised and published dataset with rich illustrations. Although the images are not included here, as of 2007-10-12 the given URLs were resolvable.
a50 5
Cicad
----
d82 1
a82 1
%META:FILEATTACHMENT{name="Erythroneura.sdd11.zip" attachment="Erythroneura.sdd11.zip" attr="h" comment="D.Dmitriev's Erythroneura key in SDD 1.1 format" date="1192553251" path="Erythroneura.sdd11.zip" size="110209" stream="Erythroneura.sdd11.zip" user="Main.GregorHagedorn" version="2"}%
@
1.5
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1192536121" format="1.1" reprev="1.5" version="1.5"}%
a34 1
d47 22
a68 1
SDD document: [[%ATTACHURL%/Erythroneura.sdd11.zip][Erythroneura.sdd11.zip]]: D.Dmitrievs Erythroneura key in SDD 1.1 format
d87 2
a88 1
%META:FILEATTACHMENT{name="Erythroneura.sdd11.zip" attachment="Erythroneura.sdd11.zip" attr="h" comment="D.Dmitrievs Erythroneura key in SDD 1.1 format" date="1192533962" path="Erythroneura.sdd11.zip" size="100658" stream="Erythroneura.sdd11.zip" user="Main.GregorHagedorn" version="1"}%
@
1.4
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1192281190" format="1.1" reprev="1.4" version="1.4"}%
d11 1
a11 3
Introduction: [[http://www.lias.net/][LIAS]] is a global information system for Lichenized and Non-Lichenized Ascomycetes. The vision of LIAS is to establish a non-commercial global information system for the collection and distribution of descriptive, phylogenetic, and other biodiversity data on these taxonomic groups that uses advanced technology and where published biodiversity data of all ascomycetes are joined in a multi-authored database and used for the most sophisticated queries. Specific goals are to
d19 1
a19 6
LIAS is the work of many collaborators. The primary editors are G. Rambold and D. Triebel. Cooperating Institutions are the University of Bayreuth, Department of Mycology,
Botanische Staatssammlung München,
University of Hamburg, Herbarium Hamburgense,
Arizona State University, Department of Plant Biology, and the
University of Oslo, Botanical Museum. It is or was supported by funds from the Bayerisches Staatsministerium für Wissenschaft, Forschung und Kunst, Bundesministerium für Bildung und Wissenschaft (BMBF), Deutsche Forschungsgemeinschaft (DFG), and
Staatliche Naturwissenschaftliche Sammlungen Bayerns.
d21 1
a21 1
Description of data set: The data set provided here is the complete [[http://www.lias.net/][LIAS]] main data set as of 2007-07. It provides descriptions of 2480 genera and species of lichens using 987 characters with a total of 7632 categorical state definitions (plus 3128 status values or statistical measures for quantitative characters). The descriptions are atomized to a total of 221821 values. Only relatively few characters and states are "pseudo-" or "management-characters", dealing with taxonomy, revision management, etc. Of the total LIAS main character data matrix of 2480x987 = 2447760 cells, 157041 cells are filled (6.4%). Part of this low fill factor is due to the taxonomic diversity encompassed in the data set, but it also shows that significant work still has to be done.
d23 1
a23 1
Related data sets: Two datasets are closely related with LIAS main: (1) [[http://liaslight.lias.net/][LIAS light]] contains fewer characters but has been more extensively revised and has a higher fill factor. It is therefore more suitable for practical identification. The editors Dr.. Rambold and Triebel are willing to make these data available in a similar manner to LIAS main, but need permission from collaborators to do so. (2) A key to 679 powdery mildews (Erysiphales), which for reasons convenience is coupled with LIAS main, has been excluded from this release.
d27 1
a27 1
Highlights for testing SDD: The LIAS dataset is a large dataset, especially suitable for testing the behavior of an application with large and rich keys.
d31 1
a31 2
####FILE YET MISSING, TO BE UPLOADED SOON###
d48 1
a48 1
####FILE YET MISSING, TO BE UPLOADED SOON###
d64 4
a67 1
-- Main.GregorHagedorn - 12 Oct 2007
@
1.3
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1192204225" format="1.1" reprev="1.3" version="1.3"}%
d46 1
a46 5
Introduction: [[ctap.inhs.uiuc.edu/dmitriev/key.asp?key=Erythroneura&lng=En&i=1&keyN=2][The Interactive Key to Species of Erythroneura]] by D. Dmitriev & C. Dietrich is available online under the [[http://ctap.inhs.uiuc.edu/dmitriev/index.asp][3I software]] created by Dmitry A. Dmitriev. 3I (Internet-accessible Interactive Identification) is a set of software tools for creating on-line identification keys, taxonomic databases, and virtual taxonomic revisions. By organizing illustrations and nomenclatural, morphological, bibliographical, and distributional data into a single database 3I also facilitates production of traditional, printed taxonomic papers and monographs. As such it is slightly more comprehensive that SDD alone, pointing into the direction into which SDD plans to evolve (online monographs including nomenclature as well as descriptions and identification tools).
d50 1
a50 1
Data conversion: The export to SDD occurred indirectly, importing the original 3I database into [[http://www.diversityworkbench.net/Portal/wiki/DiversityDescriptions][DiversityDescriptions]] and creating SDD from there. As a result, some details (specimen, nomenclature, publication data), which could in principle be expressed in SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
@
1.2
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1192198913" format="1.1" version="1.2"}%
d11 1
a11 1
d13 13
a25 3
Introduction: [[http://www.lias.net/][LIAS]] is a global information system for Lichenized and Non-Lichenized Ascomycetes, a distributed internet project containing information about phylogeny and biodiversity.
LIAS is the work of many collaborators. The primary editors are G. Rambold and D. Triebel. It is supported by funds from the Bayerisches Staatsministerium für Wissenschaft, Forschung und Kunst, Bundesministerium für Bildung und Wissenschaft (BMBF), Deutsche Forschungsgemeinschaft (DFG), and
d28 1
a28 1
Description of data set: The data set provided here is the complete [[http://www.lias.net/][LIAS]] main data set as of 2007-07 providing descriptions of 2480 genera and species of lichens, using 987 characters with a total of 7632 categorical state definitions (plus 3128 status values or statistical measures for quantitative characters). The descriptions are atomized to a total of 221821 values. Only relatively few characters and states are "pseudo-" or "management-characters", dealing with taxonomy, revision management, etc. Of the total LIAS main character data matrix of 2480x987 = 2447760 cells, 157041 cells are filled (6.4%). Part of this low fill factor is due to the taxonomic diversity encompassed in the data set, but it also shows that significant work still has to be done.
d32 1
a32 1
Data conversion: The "LIAS main" dataset is managed in DiversityDescriptions; the attached SDD 1.1 export file was created by the DiversityDescriptions export routine.
d36 4
a39 1
Copyright and license: The "LIAS main" dataset attached here is © 1996–2007 by Botanische Staatssammlung München. All rights reserved. It is here released under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5. Further details are included in the file itself.
a42 1
An Interactive Key to Species of Erythroneura
d44 3
a46 1
d48 1
d50 1
a50 1
Introduction: [[http://www.lias.net/][LIAS]] is
d52 1
a52 1
D. Dmitriev & C. Dietrich
d54 1
a54 1
Description of data set: The data set provided here is the complete [[http://www.lias.net/][LIAS]] main data set as of 2007-07 providing descriptions of 2480 genera and species of lichens, using 987 characters with a total of 7632 categorical state definitions (plus 3128 status values or statistical measures for quantitative characters). The descriptions are atomized to a total of 221821 values. Only relatively few characters and states are "pseudo-" or "management-characters", dealing with taxonomy, revision management, etc. Of the total LIAS main character data matrix of 2480x987 = 2447760 cells, 157041 cells are filled (6.4%). Part of this low fill factor is due to the taxonomic diversity encompassed in the data set, but it also shows that significant work still has to be done.
d56 1
a56 1
Data conversion: The conversion to SDD occurred indirectly, import the original 3I database into DiversityDescriptions and creating SDD from there. As a result, some details of the data (specimen, nomenclature, publications) that are included in the original version, and which could in principle be expressed in SDD 1.1, were lost because they were not fully supported by DiversityDescriptions.
d58 1
a58 1
Highlights for testing SDD: The dataset is a fully revised, dataset that is published large dataset with referenced images. The images are, however, not included in the current release.
d60 1
a60 1
Copyright and license: The Erythroneura dataset attached here is © 2003-2006 D. Dmitriev & C. Dietrich. It is here released under the Creative Commons non-commercial, by attribution, share-alike license in version 2.5.
d76 1
a76 1
-- Main.GregorHagedorn - 12 Oct 2007@
1.1
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="GregorHagedorn" date="1192188865" format="1.1" version="1.1"}%
d5 1
a5 1
To help application developers in testing their software for
d7 1
d13 1
a13 1
[[http://www.lias.net/][LIAS]] is a global information system for Lichenized and Non-Lichenized Ascomycetes, a distributed internet project containing information about phylogeny and biodiversity.
d22 1
a22 1
Highlights of LIAS main for testing SDD:
d24 1
d26 1
a26 1
Data conversion: The LIAS dataset is managed in DiversityDescriptions and the SDD 1.1 export attached is created by the DiversityDescriptions export routine.
d28 32
@