wiki-archive/twiki/data/DarwinCore/Country.txt

70 lines
5.2 KiB
Plaintext

%META:TOPICINFO{author="StephenLong" date="1161629749" format="1.1" version="1.7"}%
%META:TOPICPARENT{name="DarwinCoreDraftStandard"}%
---++Element Description: Country
The full, unabbreviated name of the country or major political unit in which the organism was collected or observed.
---++++ Open Questions:
See Open Discussion at: [[http://circa.gbif.net/Public/irc/tdwg/dwcrev/newsgroups?n=dwcrev][GBIF Circa Registries, etc.]]
• 16 Oct 2004 - Recommendation that the Country element be populated with ISO 2-letter country code. (Ginzbarg, Kirk GBIF Circa).
---+++ Comments
Use the space below to make comments about this page. -- Main.StephenLong - 24 Aug 2006
------
%ICON{bubble}% *[HERBARIA] Databasing "County"*
Posted by: Steven Ginzbarg [mailto:sginzbar@biology.as.ua.edu] Date: Sat, 16 Oct 2004, 20:26:26
If Country will be used for selecting records we should try to standardize the names. The current version of !DarwinCore2 which GBIF is now using has this description for Country: "The country or major political unit from which the specimen was collected. http://www.din.de/gremien/nas/nabd/iso3166ma/codlstp1/db_en.html
ISO 3166-1 values should be used. Full country names are currently in use. A future recommendation is to use ISO3166-1 two letter codes or the full name when searching."
However the !DarwinCore2 being proposed by Stan Blum as a TDWG draft standard does not mention using codes in the country description:
"The full, unabbreviated name of the country or major political unit from which the organism was collected."
I think some will interpret this as "United States" and others as "United States of America" unless a standard (ISO) is specified. ISO provides short country names and full country names, 2 letter codes and 3 letter codes, e.g. Alpha-2: ZA Alpha-3: ZAF English short name: SOUTH AFRICA English full name: Republic of South Africa French short name: AFRIQUE DU SUD French full name: République sud-africaine The short country names from ISO 3166-1 and the alpha-2 codes are made available by ISO for download at no charge at http://www.iso.org/iso/en/prods-services/iso3166ma/02iso-3166-code-lists/index.html.
However if full ISO country names will be required for !DwC, we will need to pay for them. The ISO3166-1 cost 100 euros or about $125 from the Deutsches Institut für Normlung (DIN) http://www2.din.de or http://www2.din.de/index.php?lang=en (English). Search for ISO3166. The full ISO3166-1 data can be ordered from ISO as an Access 2000 database for 150 CHF (Swiss Francs) or about $122.
ISO country names are available in English or French but not in the language of the country. If ISO full country names are the standard, the language would need to be specified. Country codes should be more stable than names, and short names more stable than full names when regimes change.
Steven Hill wrote: "I no longer use abbreviations. Those were important when limited space was available on hard drives, but I do not believe it to be useful now, and it is simply too much to try and remember so many acronyms and codes when one can search for a full name these days."
We have a table that converts the countries in our geographic authority file to the 2-letter ISO codes if possible. Users of our database are unaware that countries are being sent to GBIF as codes. Ideally, people querying the GBIF data portal should see the country names even though the code was what was provided. From the GBIF home page http://www.gbif.org/.
I can go to Browse, click on Country and pick a country from the list. (What standard is GBIF using for these country names?) In the advanced search the country has to be typed in to a text box.
>From: "lapham" To: Date sent: Fri, 15 Oct 2004 23:33:07 -0500 Subject: [HERBARIA] Databasing "County"
> We need metadata standards so we all know what the data means. The
> metadata data I've seen for country, is pretty vague but NBII said the
> intent was the ISO standard. If the users decide the meta data
> standards individually as seems to be the case at present, you can get
> quite a lot of variation and your queries don't prodeuce the results
> you are expecting.
> I have seen all of the following used for our country:
> USA
> USA.
> U.S.A.
> United States
> United States of America.
> US
> US.
> U.S.
> They all work for human biengs, but all are different to a computer.
> If you are expecting one of these, because that is what you use, there
> is no guarantee that someone else who is sharing data is using the
> same nomenclature. We shouldn't have to fiddle with stuff like this.
> Inicdedtally the fourth third one is the ISO nomenclature.
> Charlie Lapham
------
<DL><DD>%ICON{bubble}% *Re: [HERBARIA] Databasing County*
Posted by: Paul Kirk [mailto:p.kirk@cabi.org] Date: Sat, 16 Oct 2004, 21:34:57
The TDWG 'Geographical Recording System' provides a means of recording distribution (on land masses) via a 4 level hierarchy. One or more elements at the lowest level can be combined to give ISO equivalents (e.g. Sabah, Sarawak & Peninsular Malaysia give ISO Malaysia). This sytem is not a substitute for the original locality data but a means of standardizing to compare like with like in the absence of resources to do full georeferencing.
Paul M. Kirk;
(CABI Bioscience)
</DL>
------
------
%ICON{bubble}%