head 1.1; access; symbols; locks; strict; comment @# @; expand @b@; 1.1 date 2005.10.22.17.35.12; author BobMorris; state Exp; branches; next ; desc @none @ 1.1 log @Current IASPS schema @ text @ The root container for a IAS profile. Please note this is an early draft based on Mike Browne's spreadsheet and I am not a schema expert. Some elements flagged as placeholders are references to schemas defined elsewhere, eg ABCD/DC/UBIF etc. Some elements have been borrowed from ABCD2.05 and TCS where appropriate. Most elements are not 'typed' and I have not expanded enumerations. The general intent is that higher level elements act as free text containers but also have finer levels of atomization. Jerry Cooper 26/08/2005 A globally unique identifier for the entire data collection the present dataset was derived from. The exact format and/or semantics are still under discussion. DublinCore-like container for metadata describing the source of this entire profile Fact sheet data, derived from numerous sources, for a specific region, or general statements throughout range of taxon, ordinated by taxon/location one or more taxon/location/fact triplets. TCS GUID and/or locally stored elements derived from IAS dbs Should only be populated if this is a regional fact sheet element - see switch within Facts A placeholder for lists/occurrence data - best described within external schemas - ABCD/DC etc. pointers to DarwinCore/ABCD/StructuredSurvey data. Such data should be shared via DiGIR-DC or BioCASE/ABCD. Not sure about linkage of DC with reference here. Needs further consideration. ABCD Metadata element the sources used to build this profile. Lit references, web references, people. Against keyword to indicate profile component. This mechanims rather than many reference elemnts for each component. Linkage to TCS data with some minimal local caching. Note that source dbs containing detailed and authoritative taxonomic data should share that data via the TaxonConceptSchema. The GUID of the taxon concept - even if nominal concept - within an akcknowldeged taxon name catalogue - the ECATGUID (superset of ITIS/COL/Species2000/GSD/RSD s) Full name without author string Rank data should be derived from taxon db's via the GUID Critical to capture these data from IAS dbs Virus Bacteria Fungus Arachnid Insect Flatworm Nematode Mollusc Alga/seaweed Anemone Coral Comb jelly Jellyfish Crustacean Starfish Fish Amphibian Reptile Bird Mammal Aquatic plant Palm Tree Shrub Grass Rush Sedge Herb Vine, climber Bromeliad Fern Succulent Linkage to referenced data - not enumerated here URI of an external reference cached elements of a reference - not enumerated here. Structures exist elsewhere. External namespace? keywords to match profile component decriptors. Third party comment In its simplest form this is a URL. The image provider to ensure appropriate image metadata in the web form referenced. A TDWG group is being formed to deal with image metadata standards. Could be expanded here - structures exist elsewhere. Native or introduced terrestrial, fresh water,brackish, marine, Coastland Marine habitats Estuaries Lakes Watercourses Riparian zones Wetlands Urban areas Agricultural areas Disturbed areas Planted forests Natural forests Scrub/shrublands Range/grasslands Tundra Deserts Ice Host Vector generic container for multiple habitat definitions free text Think about including a lifecycle-stage option Interaction and distribution of hosts. Mechanims here is poor and will repeat identical host data for different taxa? Project or case study General or Local fact sheet? A fact sheet about a specific area 1 Absent 1.1 Recorded in error 1.2 Extinct 1.3 Eradicated 1.4 Border intercept 2 Reported 2.1 Established (able to survive) 2.2 Naturalised (reproducing) 2.4 In captivity/cultivated 2.5 Sometimes present 2.6 Present/controlled 3 Uncertain Alien Native (no further data) Native. Endemic Native. Non-endemic Not specified Biostatus uncertain Invasive Not invasive Not specified Uncertain same elements as 'Introduction' but different vocabulary. Better way to handle this? free text. Widespread, localised etc. free text.Where to put Population dynamic terms such as growth rate, age, stable, expanding, numerous, few, trend? c.f. distribution and occurrence If a 'general' fact sheet then higher level location elemnt does not need populating. Status of organims being impacted, e.g. rare, endangred, threatened. port, protected area etc Need to assure compliance with geospatial standards I think we need accuracy and uncertainty Extrenal references, e.g. to WMS providers. Probably a better way? What do FDGC do? think about relationship with requirements e.g. disease cycle Position in the food chain, determined by the number of energy-transfer steps to that level. Persistence, Capacity to establish and spread, Capable of securing and ingesting a wide range of food, Tolerant of physical conditions, Pioneering in disturbed or vacant habitats What triggers growth Where/when to look for it, how to attract/trap it, where to find a diagnostic key/expert to identify it, etc. how to collect it, who to send sample to, warning statements etc. Flag as post or pre Altered trophic level Changed gene pool Conflict Damaged ecosystem services Ecosystem change Habitat alteration Host damage Increases vulnerability to invasions Infrastructure damage Loss of endangered species Loss of medicinal resources Loss of native species Modification of fire regime Modification of hydrology Modification of natural benthic comunities Modification of nutrient regime Modification of successional patterns Monoculture formation Negatively impacts agriculture Negatively impacts aquaculture Negatively impacts cultural/traditional practices Negatively impacts forestry Negatively impacts human health Negatively impacts livelihoods Negatively impacts mariculture Negatively impacts tourism Obstructs waterways Other Reduced amenity values Reduced native biodiversity Selective loss of genotypes Soil accretion Threat to endangered species Threat to native species Transportation disruption Alleopathic Causes allergic response Competition-Monopolising resources Competition-Shading Competition-Smothering Competition -Strangling Competition-Other Disease transmission Filtration Fouling Herbivory/Grazing/Browsing Hybridisation Induces hypersensitivity Interaction with other invasive species Parisitism Pathogenic Poisoning Pollen swamping Predation Rapid growth Rooting Trampling e.g. high, moderate, minor or potential Economic, Social, environmental Free text Generic user defined classification scheme Metadata referring to the principal source of the entire data collection (thus the metadata scope may be wider than the objects actually contained in the data set). If a history of the data collection (revised or expanded in various projects or at different institutions) exist, this must be reflected in the IPR statements and possibly in the list of Owners. Dublin Core conformant elements describing the content of the data source queried, representation in different languages possible The description in a specified language. Only one representation should be delivered for each language. [ATTR: language] The URI of an icon/logo symbolizing the project. Keyword lists of geographical, taxonomic, etc. scopes. In the case of projects in progress, 'scope' may define the planned or intended, rather than the achieved scope (or coverage). If scope is given, the content available should be entirely within scope, because this item is for resource discovery purposes. Compare also Coverage in DC.Description (which is language-specific). (Items from Scope may be added to DC.Coverage) A collection of terms describing the geoecological scope of the source queried by means of area names (e.g. 'Worldl', 'Germany', 'Atlantic Ocean', 'Andes', 'Mountains'. A list of recommended terms should be developed. A collection of taxon names of higher rank describing the taxonomic scope of the source queried. A list of recommended terms should be developed. Number and date of current version (particularly for citing purposes) The major version number ('1' in 1.2) as defined by the content creators. An optional minor version number ('2' in 1.2) Unconstrained text specifying status + optional number, e. g., 'beta', 'alpha', 'rc/release candidate', 'internal'. If missing, release status is assumed. Source for Dublin-Core standard element Date.Issued: Citable 'publication date' of the current version (comp. RevisionData/DateCreated and DateModified for version- independent dates). This date should be missing if the current version is not yet published! Creators, Revision status, and dates of the entire data collection from which the current dataset is derived. Entities having legal possession of the data collection content. Here defined for the entire data collection, not for individual units. If an owner statement is present on the unit level, it should override this dataset-level statement. Entitiy having legal possession of the data collection content. Copyright, terms of use, license and other IPR-related statements like disclaimer or acknowledgement. Giving a copyright statement and a (if possible public) licence is highly recommended! (=DC.Rights) Needs to refer to an agent model. Not enumerated here. experts in these set of taxa (at any rank) Tropical Subtropical Temperate Boreal Polar, All Host taxon location Native in this location or not Aircraft Aquaculture stock Bait Bulk freight/cargo Clothing/footwear Consumable Container Debris associated with human activities Floating vegetation/debris Germplasm Habitat material Hides, trophies, feathers Host organism Humans Live seafood Luggage Machinery/equipment Mail Mulch, straw, baskets, sod, etc Other live animal Other vector Pet Plant or parts of plants Sailor's seachests Ship ballast water/sediment Ship bilge water Ship holds, cabins, etc. Ship structures above the water line Ship/boat Ship/boat hull fouling Shipping material Soil, sand etc. Vehicles Waste associated with human activities Water Wind Acclimatisation Societies Agriculture (includes horticulture, forestry, etc.) Aid Angling Aquaculture Aquarium trade Biological control Botanical gardens Breeding Cut flower trade Digestion/excretion Disturbance Dune stabilisation Erosion control Escape from confinement Flooding Food Forage Forestry Garden escape Garden waste disposal Harvesting fur/wool/hair Hedges Hitchhiker Horticulture Hunting Industrial purposes Intentional release Interbasin transfers Interconnected waterways Internet sales Landscape/fauna "improvement" Landscaping industry Live food trade Mariculture Medicinal use Military movements Natural disaster Nursery trade Off-site preservation Ornamental purposes Other cause People foraging People sharing resources Pet trade Propagation Racing Research Seed trade Self-propelled Smuggling Stocking Timber trade Windbreaks Worm cultivation Zoos Rememember to add de-aggregated containers Intentional Intentional, illegally by natural means Unintentional Unknown Community needs to provide advice here e.g. mechanical individuals/Organiztions/ contacts etc. Elsewhere Below ABCD elements normalized string required to contain at least 1 character (this removes the xml string anomaly, i. e. either element/attribute may be optional, but if they are required the content may not be an empty string) Text, optional Details (both free-form text) and optional URI. A concise representation of a statement, recommended to be as short as possible, but actual length is unconstrained. Optional text of unconstrained length, elaborating details of the ShortText An optional resource on the net providing details on the statement (may be used as an alternative to the long text). A sequence of statements related to Intellectual Property Rights, credit and acknoledgement. Container element for one to several statements, normally representing different language representations of the same content. Used where the IPR declaration cannot be parsed into the specific items or for forms of IPR declaration not yet covered (e.g. database rights), Container element for one to several statements, normally representing different language representations of the same content. Copyright may include the information that the data has been released to the public domain. Container element for one to several statements, normally representing different language representations of the same content. To be used if data are placed under a public license (GPL, GFDL, OpenDocument). Placing data under a public license while maintaining copyright is recommended! Container element for one to several statements, normally representing different language representations of the same content. Defines conditions under which the data may be analised, distributed or changed. "Terms of use" includes concepts like "Usage conditions" and "Specific Restrictions". Container element for one to several statements, normally representing different language representations of the same content. Disclaimer statement, e. g. concerning responsibility for data quality or legal implications. Container element for one to several statements, normally representing different language representations of the same content. A free form text acknowledging support (e. g. grant money, help, permission to reuse published material, etc.) Container element for one to several statements, normally representing different language representations of the same content. Indicates how this dataset or record should be attributed if used [OBIF 1.0] Language-specific content metadata (title, description, etc.) with *required* Language attribute added. Source for Dublin-Core standard element "Title": A short, concise title. General Note on DublinCore translation: In addition to those that can bee transformed from UBIF metadata, an additional DC.Type='dataset' should be added. Source for Dublin-Core standard element "Description": Free-form text containing a longer description of the project. Source for Dublin-Core standard element "Coverage": Free-form text describing geographic, taxonomic, or other coverage aspects of terminology or descriptions available in the current project. URL pointing to an online source related to the current project, which may or may not serve an updated version of the description data. normalized string restricted to 1..255 character length (i. e. required, may not be empty string) Language-specific simple label, using simple formatted text Label text in a specific language. Restricted to 50 characters maximum length, including blanks (recommended to be shorter!). Label abbreviations are especially important when displaying information in a tabular format. String255 (i.e. xs:string with length 1-255), extended with language attribute Name of an individual person Preferred form of personal name for display as a string. The full name with the elements in preferred sorting sequence (vCard: Sort-String). Family names, generational names, clan name, parents/grandparents personal names, etc. This (= last name in western cultures) may be compound ('Fischer von Waldheim', 'da Selva', 'Silvano Morales'). Depending on culture it is not necessarily the name of the parents nor common to the married couple and children, thus 'family name' should be avoided even though used in vCard. (vCard:N.Family) Prefix to inherited name that should be output before name, but is usually not included in sorting. Examples: 'von', 'Lord'. Compare Title for 'Prof', 'Dr.' (vCard:N.Prefix) Suffix to name that should be output after name, regardless whether it is in sorting sequence (Inherited, Given) or not. Examples: 'Jun.', 'III.'. (vCard:N.Suffix) The name given to a person as a personal name (= first or christian name in western cultures, including 'middle initials') may contain several words ('Ana Maria', 'Jerry B.'). Applicable only to persons. (vCard:N.Given + vCard:N.Middle) May differ from the first given name: second given name, nickname ('Bob' for 'Robert'), etc. (vCard:Nickname) String (i. e. xs:string with minimum length=1) extended with language attribute String (i. e. xs:string with minimum length=1) extended with language and preferred attribute xs:anyURI extended with Preferred attribute String255 (i.e. xs:string with length 1-255), extended with preferred attribute normalized string restricted to 1..50 character length to be used for abbreviations (the recommended length of abbreviations is usually much shorter, but 50 characters should be a Collection of language-specific label representations Language-specific label representation [ATTR: language] Full organization or corporate name in multiple languages (en: 'Botanical Garden of ...', de: 'Botanischer Garten von ...'). vCard:Org.OrgName). The Label mechanism also supports acronyms / abbreviations (no vCard equivalent!). If the contact contains no person definition: the unit within the organization the agent represents; else a list of the various organisational units to which a person may belong. (vCard:OrgUnit) (vCard:OrgUnit) Type of device reached by telephone number, e.g. voice, fax, voice/fax Telephone or Fax number Full number in standard international format Free text for constraints on use e.g. "weekdays only" or "home number" Contact details. Organisation to contact Person to contact Analogue to vCard:Role: Functional contact name, e.g. "Database administrator", "The Director", etc. Contact addresses (one preferred, different languages possible) Contact address as string. Telephone and fax numbers Telephone or fax number E-mail addresses E-mail address for contact URIs for person or organisation URI for person or organisation URL of a logo image. RevisionData (creators, dates, revision) for the entire project/data set or individual objects. Source for Dublin-Core standard element "Creators", i.e. Author or editor. Source for Dublin-Core standard element"Contributors": General contributors, or translators. Source for Dublin-Core standard element "Date.Created": Date/time when the intellectual content (project, term, description, etc.) was created. Applications may initially set this to the system date for new data objects, but authors must be able to change it to an earlier date if necessary. If for legacy data this is imprecisely known, it may be missing here. Earlier versions in other data formats should then be mentioned in the copyright or acknowl. statements. Source for Dublin-Core standard element Date.Modified: Date/time when the last modification of the object was made. If in online data sources the provider can not assess this, the current date/time may be substituted. For legacy data this may be set to the file date of imported data, or estimated. marine - trophic level? micro-organism fungus (Groups traditionally studied by mycologists and the ICBN) arachnid insect flatworm nematode mollusc (Aquatic and terrestrial) alga anemone coral comb jelly jellyfish crustacean starfish fish amphibian reptile bird mammal aquatic plant (Floating or submerged) palm tree shrub grass rush sedge herb vine, climber bromeliad fern succulent Pam's FreshWaterTerms: All Amphibians-Frogs Amphibians-Salamanders Fishes Mammals Reptiles-Crocodilians Reptiles-Lizards Reptiles-S Brief synopsis of species' invasiveness, significance, interesting facts. NB This is a mixed bag of elements. Species level descriptive data should be provided by SpeciesBank sources with its own descriptive schema. Morphological data should be provided through SDD elements. Again this component is acting as a cache for data best delivered elsewhere or for reflecting the actual content of profile dbs. Not sure how far GBIF SpeciesBank has got with data standards TDWG SDD container reference Below borrowed from TCS Placeholder for other schema standards used for long text @