head 1.14; access; symbols; locks; strict; 1.14 date 2009.06.13.09.09.17; author GregorHagedorn; state Exp; branches; next 1.13; 1.13 date 2006.09.07.02.06.06; author DonovanSharp; state Exp; branches; next 1.12; 1.12 date 2006.09.07.00.50.03; author DonovanSharp; state Exp; branches; next 1.11; 1.11 date 2006.08.29.03.03.55; author DonovanSharp; state Exp; branches; next 1.10; 1.10 date 2006.07.10.01.41.41; author DonovanSharp; state Exp; branches; next 1.9; 1.9 date 2006.07.09.04.37.51; author DonovanSharp; state Exp; branches; next 1.8; 1.8 date 2006.07.08.05.16.39; author DonovanSharp; state Exp; branches; next 1.7; 1.7 date 2006.07.07.02.20.40; author DonovanSharp; state Exp; branches; next 1.6; 1.6 date 2006.07.06.05.48.50; author DonovanSharp; state Exp; branches; next 1.5; 1.5 date 2006.07.05.07.30.49; author DonovanSharp; state Exp; branches; next 1.4; 1.4 date 2006.07.05.02.18.30; author DonovanSharp; state Exp; branches; next 1.3; 1.3 date 2006.06.08.04.27.56; author DonovanSharp; state Exp; branches; next 1.2; 1.2 date 2006.06.02.05.00.02; author DonovanSharp; state Exp; branches; next 1.1; 1.1 date 2006.06.01.05.40.38; author DonovanSharp; state Exp; branches; next ; desc @@ 1.14 log @none @ text @%META:TOPICINFO{author="GregorHagedorn" date="1244884157" format="1.1" version="1.14"}% %META:TOPICPARENT{name="SddContents"}% ---++SDD Part 0: Introduction and Primer to the SDD Standard ---+++2.3 Natural language descriptions. ---+++2.3.1 Traditional natural language descriptions. Natural-language descriptions (Box 2.2.1) are semi-structured, semi-formalised descriptions of a taxon (or occasionally of an individual specimen). They may be simple, short and written in plain language (if used for a popular field guide), or long, highly formal and using specialised terminology when used in a taxonomic monograph or other treatment. ---++++Box 2.3.1 - Typical natural language descriptions

Red Knot (Calidris canutus)
Stout wader with bill same length as head, crown unstreaked, narrow white bar in wing, pale rump with grey barring, shortish olive legs. Non-breeding: grey above with narrow pale edging to feathers, pale eyebrow, smudged sides to neck with faint spotting. Juvenile: feathers of back edged white with dark subterminal bar, breast more heavily spotted pale buff and flanks barred, crown faintly streaked. Breeding: rufous underparts, feathers of back rufous patterned with black. Voice: 'knut-knut', `nyui , high-pitched `toowit-wit'.

from Slater, P., Slater, P. & Slater, R. (2001) The Slater Field Guide to Australian Birds  (Reed New Holland: Sydney)

Discaria pubescens (Brongn.) Druce
Rigid, spreading shrub to c. 1 m high and wide; stems glabrous. Leaves soon deciduous, c. oblong, to 10 mm long, 3 mm wide, obtuse or minutely mucronate within an apical notch, margins minutely toothed, surfaces glabrous or a few hairs present near tip; stipules dark reddish-brown, c. 1 mm long, often shallowly joined around the node, pubescent on inner face; spines stout, 1.5-4 cm long. Flowers white, solitary or in few-flowered axillary cymes, sometimes congested on short apical shoots; pedicels 2-3 mm long; hypanthium c. 1.5 mm long; sepals somewhat spreading, 1-1.5 mm long; petals attached at throat of hypanthium, c. 1 mm long; stamens subequal to and weakly hooded by petals; disc prominent, lining base of hypanthium, obscurely 5-angled; style minute. Capsule prominently 3-lobed, 4-5 mm diam., the valves separating incompletely at maturity and splitting dorsally and medially.

from Walsh, N.G. (1999) Rhamnaceae, in N.G.Walsh & T.J.Entwisle, Flora of Victoria Volume 4, Dicotyledons, Cornaceae to Asteraceae (Inkata Press: Melbourne)

There are two methods for the production of natural language descriptions within SDD. * Descriptions may be produced elsewhere and simply stored within an SDD instance document, these are "authored natural language descriptions" * Descriptions may be generated from data and text snippets sourced from within the SDD instance document, these are termed "marked up natural language descriptions". ---+++2.3.2 Authored natural language descriptions. Authored natural language descriptions are simply descriptions written by hand, either within an application or imported into an application, including legacy descriptions sourced from existing publications. Within SDD "authored" descriptions may never be overwritten by a natural language reporting process, whereas "generated" descriptions may be updated. Both "authored" and generated descriptions may contain markup (data supplied from a coded data source) but this is not required. All natural language descriptions are nested within the <NaturalLanguageDescriptions> element within [[SddDatasets][<Dataset>]]. A natural language description requires only two essential items: the names of the taxa being described, and the descriptions themselves. A simple SDD instance document for natural language descriptions has the basic structure shown below and in Example 2.3.2. %ATTACHURL%/authoreddescriptions.gif ---++++Example 2.3.2 - Anchored natural language descriptions
Herbs, shrubs; or trees, monoecious or rarely dioecious. Leaves alternate, margins usually dentate or crenate. Flowers small, males and females in separate axillary spikes or females solitary in separate axils or one or more at or near base of male spikes; male flowers clustered in axillary spikes with small bract under each cluster, perianth of 4 segments, glands absent, stamens 8 or rarely 8-16 inserted on a raised central receptacle, filaments free; female flowers 1-4 together within a leafy bract, bracts solitary or in spikes, perianth of 3 segments, rarely 4, styles distinct, finely branched. Fruits capsules. Herb up to 30 cm tall, stems and leaves often pink or red. Leaves with petioles 1-2 cm long; blades ovate or subrhomboid, apex acuminate, base acute or obtuse, margin serrulate-crenate, 2-6 cm X 1-3.5 cm. Spikes short, 1-3 per axil, peduncles ca 0.5-1 cm long or longer; bracts up to ca 1.5 cm long.
For more information on defining taxon names using the <TaxonNames> element, see the topic [[TaxonNames][Defining taxon names]]. Note that taxa can also be arranged into hierarchies. See the topic [[TaxonHierarchies][Defining taxon hierarchies]] for more information. The <Representation> element provides a label for the description. This may be useful if the instance document includes multiple descriptions for different purposes. <Scope> describes the taxon or set of taxa to which the description applies. The <NaturalLanguageData> element contains the text of the natural language description. ---+++2.3.3 Marked up natural language descriptions. %ATTACHURL%/naturallanguagemarkup.gif Marking up of natural language descriptions allows parsing of matrix data into natural language descriptions and modification of character and state names for inclusion in natural language descriptions. "Authored" descriptions may never be overwritten by a natural language reporting process, whereas "generated" descriptions may be updated. Both "authored" and "generated" descriptions may have markup, but do not need to. The sdd standard is capable of storing data with partial markup, resulting from any mixture of automatic markup by a processor or manual markup. -- Main.DonovanSharp - 01 Jun 2006 * Long expanded version of NLD structures:
NaturalLanguageMarkupLong.png %META:FILEATTACHMENT{name="naturallanguagemarkup.gif" attr="" autoattached="1" comment="" date="1152238734" path="naturallanguagemarkup.gif" size="18308" user="Main.DonovanSharp" version="1"}% %META:FILEATTACHMENT{name="authoreddescriptions.gif" attr="" autoattached="1" comment="" date="1152495701" path="authoreddescriptions.gif" size="10786" user="Main.DonovanSharp" version="6"}% %META:FILEATTACHMENT{name="naturallanguage.gif" attr="" autoattached="1" comment="" date="1152238700" path="naturallanguage.gif" size="7501" user="Main.DonovanSharp" version="1"}% %META:FILEATTACHMENT{name="NaturalLanguageMarkupLong.png" attachment="NaturalLanguageMarkupLong.png" attr="h" comment="Long expanded version of NLD structures" date="1244884156" path="NaturalLanguageMarkupLong.png" size="51612" stream="NaturalLanguageMarkupLong.png" user="Main.GregorHagedorn" version="1"}% @ 1.13 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1157594766" format="1.1" version="1.13"}% d134 3 d140 1 @ 1.12 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1157590203" format="1.1" reprev="1.12" version="1.12"}% d129 1 a129 1 Marking up of natural language descriptions allows parsing of matrix data into natural language descriptions and modification of character and state names for inclusion in natural language descriptions. @ 1.11 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1156820635" format="1.1" reprev="1.11" version="1.11"}% d129 1 a129 1 no content yet @ 1.10 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1152495701" format="1.1" reprev="1.10" version="1.10"}% d5 2 a6 2 ---+++2.2 Natural language descriptions. ---++++2.2.1 Traditional natural language descriptions. d10 1 d14 1 a14 1 Box 2.2.1 - Typical natural language descriptions d55 1 a55 1 ---+++2.2.2 Authored natural language descriptions. d61 1 a61 1 A simple SDD instance document for natural language descriptions has the basic structure shown below and in Example 2.1.1. d65 1 a65 1 ---++++Example 2.2.2 - Anchored natural language descriptions d125 1 a125 1 ---+++2.2.3 Marked up natural language descriptions. d135 1 a135 1 %META:FILEATTACHMENT{name="authoreddescriptions.gif" attachment="authoreddescriptions.gif" attr="" comment="" date="1152495701" path="authoreddescriptions.gif" size="10786" stream="authoreddescriptions.gif" user="Main.DonovanSharp" version="6"}% @ 1.9 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1152419871" format="1.1" reprev="1.9" version="1.9"}% d134 1 a134 1 %META:FILEATTACHMENT{name="authoreddescriptions.gif" attr="" autoattached="1" comment="" date="1152419774" path="authoreddescriptions.gif" size="11788" user="Main.DonovanSharp" version="4"}% @ 1.8 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1152335799" format="1.1" reprev="1.8" version="1.8"}% d51 1 a51 1 * Descriptions may be generated from data and text snippets sourced from within the SDD instance document, these are termed "marked up natural langusge descriptions". d58 4 d114 1 d116 8 d134 1 a134 1 %META:FILEATTACHMENT{name="authoreddescriptions.gif" attachment="authoreddescriptions.gif" attr="" comment="" date="1152335799" path="authoreddescriptions.gif" size="11008" stream="authoreddescriptions.gif" user="Main.DonovanSharp" version="3"}% @ 1.7 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1152238840" format="1.1" reprev="1.7" version="1.7"}% d48 1 d50 2 a51 1 Natural language descriptions are contained within the [[SddDatasets][<Datasets>]] element in SDD. a52 1 %ATTACHURL%/naturallanguage.gif a53 42 Within the <NaturalLanguageDescriptions> element are three element groups. Firstly a simple <Representation> to contain Authored natural language descriptions. Secondly a [[SddScope][<Scope>]] element to define the entities described (a taxon samples at two localities may have differing descriptions for example). Thirdly a <NaturalLanguageData> element to contain the data required to produce marked-up natural language descriptions ---++++Example 2.2.1 - Anchored natural language descriptions
... describes the generating application ... provides a list of the entities (usually taxa) described ... the description text (see below)
In SDD natural language descriptions may be authored or generated free-form descriptions, which may be completely or partially marked-up with elements similar to those in coded descriptions. d117 1 d68 10 a77 7 Herbs, shrubs; or trees, monoecious or rarely dioecious. d87 8 a94 8 Herb up to 30 cm tall, stems and leaves often pink or red. d99 4 a102 3 d121 1 a121 2 %META:FILEATTACHMENT{name="authoreddescriptions.gif" attachment="authoreddescriptions.gif" attr="" comment="" date="1152335111" path="authoreddescriptions.gif" size="10517" stream="authoreddescriptions.gif" user="Main.DonovanSharp" version="2"}% %META:FILEATTACHMENT{name="authoreddescriptions.gif" attr="" autoattached="1" comment="" date="1152238715" path="authoreddescriptions.gif" size="8948" user="Main.DonovanSharp" version="1"}% @ 1.6 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1152164930" format="1.1" reprev="1.6" version="1.6"}% d51 1 a51 1 %ATTACHURL%/naturallanguage.jpg d99 1 a99 1 %ATTACHURL%/authoreddescriptions.jpg d150 1 a150 1 %ATTACHURL%/naturallanguagemarkup.jpg d157 3 a159 3 %META:FILEATTACHMENT{name="authoreddescriptions.jpg" attr="" autoattached="1" comment="" date="1152083762" path="authoreddescriptions.jpg" size="28514" user="Main.DonovanSharp" version="1"}% %META:FILEATTACHMENT{name="naturallanguage.jpg" attr="" autoattached="1" comment="" date="1149740876" path="naturallanguage.jpg" size="16414" user="Main.DonovanSharp" version="1"}% %META:FILEATTACHMENT{name="naturallanguagemarkup.jpg" attr="" autoattached="1" comment="" date="1152084648" path="naturallanguagemarkup.jpg" size="27985" user="Main.DonovanSharp" version="3"}% @ 1.5 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1152084649" format="1.1" reprev="1.5" version="1.5"}% a56 2

    
d58 7
a64 1
   <Datasets>
d66 1
a66 1
     <TechnicalMetadata>
d68 1
a68 1
     </TechnicalMetadata>
d70 1
a70 1
       <Dataset>
d72 1
a72 1
         <TaxonNames>
d74 1
a74 1
         </TaxonNames>
d76 1
a76 1
           <NaturalLanguageDescriptions>
d78 1
a78 1
           </NaturalLanguageDescriptions>
d80 2
a81 2
       </Dataset>
   </Datasets>
d84 7
a90 2
	

d103 4 d142 2 d145 3 d159 1 a159 1 %META:FILEATTACHMENT{name="naturallanguagemarkup.jpg" attachment="naturallanguagemarkup.jpg" attr="" comment="" date="1152084648" path="naturallanguagemarkup.jpg" size="27985" stream="naturallanguagemarkup.jpg" user="Main.DonovanSharp" version="3"}% @ 1.4 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1152065910" format="1.1" reprev="1.4" version="1.4"}% d53 2 d88 1 a88 1 Authored natural language descriptions are simply descriptions written by hand, either eithin an application or imported into an application, including legacy descriptions sourced from existing publications. Within SDD "authored" descriptions may never be overwritten by a natural language reporting process, whereas "generated" descriptions may be updated. Both "authored" and generated descriptions may contain markup (data supplied from a coded data source) but this is not required. All natural language descriptions are nested within the <NaturalLanguageDescriptions> element within [[SddDatasets][<Dataset>]]. d90 2 d132 2 d139 3 a141 1 %META:FILEATTACHMENT{name="naturallanguage.jpg" attachment="naturallanguage.jpg" attr="" comment="" date="1149740875" path="naturallanguage.jpg" size="16414" stream="naturallanguage.jpg" user="Main.DonovanSharp" version="1"}% @ 1.3 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1149740876" format="1.1" reprev="1.3" version="1.3"}% d49 1 a49 1 Natural language descriptions are contained within the [[SddDatasets][Datasets]] element in SDD. d86 1 a86 1 Authored natural language descriptions are simply descriptions written by hand, either eithin an application or imported into an application, including legacy descriptions sourced from existing publications. Within SDD "authored" descriptions may never be overwritten by a natural language reporting process, whereas "generated" descriptions may be updated. Both "authored" and generated descriptions may contain markup (data supplied from a coded data source) but this is not required. All natural language descriptions are nested within the NaturalLanguageDescriptions tag within [[SddDatasets][Dataset]]. @ 1.2 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1149224402" format="1.1" reprev="1.2" version="1.2"}% d51 2 d132 2 @ 1.1 log @@ text @d1 1 a1 1 %META:TOPICINFO{author="DonovanSharp" date="1149140438" format="1.1" version="1.1"}% d3 1 d5 2 a6 1 SDD Part 0: Introduction and Primer to the SDD Standard a7 2 2.2 Natural language descriptions. 2.2.1 Traditional natural language descriptions. a9 3 Box 2.2.1 - Typical natural language descriptions Red Knot (Calidris canutus) Stout wader with bill same length as head, crown unstreaked, narrow white bar in wing, pale rump with grey barring, shortish olive legs. Non-breeding: grey above with narrow pale edging to feathers, pale eyebrow, smudged sides to neck with faint spotting. Juvenile: feathers of back edged white with dark subterminal bar, breast more heavily spotted pale buff and flanks barred, crown faintly streaked. Breeding: rufous underparts, feathers of back rufous patterned with black. Voice: 'knut-knut', `nyui , high-pitched `toowit-wit'. d11 36 a46 1 from Slater, P., Slater, P. & Slater, R. (2001) The Slater Field Guide to Australian Birds (Reed New Holland: Sydney) a47 2 Discaria pubescens (Brongn.) Druce Rigid, spreading shrub to c. 1 m high and wide; stems glabrous. Leaves soon deciduous, c. oblong, to 10 mm long, 3 mm wide, obtuse or minutely mucronate within an apical notch, margins minutely toothed, surfaces glabrous or a few hairs present near tip; stipules dark reddish-brown, c. 1 mm long, often shallowly joined around the node, pubescent on inner face; spines stout, 1.5-4 cm long. Flowers white, solitary or in few-flowered axillary cymes, sometimes congested on short apical shoots; pedicels 2-3 mm long; hypanthium c. 1.5 mm long; sepals somewhat spreading, 1-1.5 mm long; petals attached at throat of hypanthium, c. 1 mm long; stamens subequal to and weakly hooded by petals; disc prominent, lining base of hypanthium, obscurely 5-angled; style minute. Capsule prominently 3-lobed, 4-5 mm diam., the valves separating incompletely at maturity and splitting dorsally and medially. d49 1 d51 1 a51 1 from Walsh, N.G. (1999) Rhamnaceae, in N.G.Walsh & T.J.Entwisle, Flora of Victoria Volume 4, Dicotyledons, Cornaceae to Asteraceae (Inkata Press: Melbourne) d53 2 d56 1 a56 7 Natural language descriptions are contained within the element in SDD. d126 1 Example 2.2.1 - Anchored natural language descriptions d58 3 a60 3 ... describes the generating application, see (Technical Metadata) d62 1 a62 1 d64 3 a66 4 ... provides a list of the entities (usually taxa) described, see (Taxon names) d68 1 a68 1 d70 1 a70 1 d72 2 a73 2 d76 2 d82 1 a82 2 2.2.2 Authored natural language descriptions. Authored natural language descriptions are simply descriptions written by hand, either eithin an application or imported into an application, including legacy descriptions sourced from existing publications. Within SDD "authored" descriptions may never be overwritten by a natural language reporting process, whereas "generated" descriptions may be updated. Both "authored" and generated descriptions may contain markup (data supplied from a coded data source) but this is not required. All natural language descriptions are nested within the tag within . d84 1 a84 1 Example 2.2.2 - Anchored natural language descriptions d86 2 d89 1 a89 1 d121 1 a121 1 d124 2 a125 1 2.2.3 Marked up natural language descriptions. @