185 lines
11 KiB
Plaintext
185 lines
11 KiB
Plaintext
head 1.3;
|
|
access;
|
|
symbols;
|
|
locks; strict;
|
|
comment @# @;
|
|
|
|
|
|
1.3
|
|
date 2007.08.21.04.14.55; author BobMorris; state Exp;
|
|
branches;
|
|
next 1.2;
|
|
|
|
1.2
|
|
date 2006.03.15.03.15.24; author BobMorris; state Exp;
|
|
branches;
|
|
next 1.1;
|
|
|
|
1.1
|
|
date 2006.03.15.00.26.49; author AnnieSimpson; state Exp;
|
|
branches;
|
|
next ;
|
|
|
|
|
|
desc
|
|
@none
|
|
@
|
|
|
|
|
|
1.3
|
|
log
|
|
@none
|
|
@
|
|
text
|
|
@%META:TOPICINFO{author="BobMorris" date="1187669695" format="1.1" version="1.3"}%
|
|
%META:TOPICPARENT{name="TerminologyWorkingGroup"}%
|
|
---+++Some Sample Definitions
|
|
(by Kevin Thiele)
|
|
|
|
%TABLE{tablerules="all"}%
|
|
|*Source*| | | |K.R. Thiele||||||||||| |CBD| |Thomson 1991 (from GISP)| |GISP|
|
|
|*Term*| | | |Alien| |Invasive| |Introduced| |Native| |Adventive| |Cryptogenic| |Alien| |Invasive| |Invasive|
|
|
|Origin| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Outside polygon of interest| |Yes| |Yes| |Yes| | | |Yes| | | |Yes| |Yes| |Yes|
|
|
| | |Inside polygon of interest| | | | | | | |Yes| | | | | | | | | | |
|
|
| | |Not relevant| | | | | | | | | | | | | | | | | | |
|
|
| | |Not known| | | | | | | | | | | |Yes| | | | | | |
|
|
|Human agent in range change?| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Yes| |Yes| |Yes| | | | | |Yes| | | | | | | |Yes|
|
|
| | |No| | | | | | | | | | | | | | | | | | |
|
|
| | |Not relevant| | | | | |Yes| | | | | | | |Yes| |Yes| | |
|
|
| | |Not known| | | | | | | | | | | |Yes| | | | | | |
|
|
| | |Not applicable| | | | | |Yes| | | | | | | | | | | | |
|
|
|Impact| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Has negative impact (at least sometimes)| | | | | | | | | | | | | | | | | | |
|
|
| | |Does not have negative impact| | | | | | | | | | | | | | | | | | |
|
|
| | |Not relevant| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes|
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
|Establishment| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Established| | | |Yes| | | |Yes| | | |Yes| | | |Yes| |Yes|
|
|
| | |Not established| | | | | | | | | | | | | | | | | | |
|
|
| | |Not relevant| |Yes| | | |Yes| | | |Yes| | | |Yes| | | | |
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
|Geographic Range| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Widespread and/or spreading| | | |Yes| | | | | | | | | | | |Yes| |Yes|
|
|
| | |Not widespread, not spreading| | | | | | | | | |Yes| | | | | | | | |
|
|
| | |Not relevant| |Yes| | | |Yes| |Yes| | | |Yes| |Yes| | | | |
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
|Ecological Range| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Restricted to human-modified habitats| | | | | | | | | | | | | | | | | | |
|
|
| | |Occurs in natural habitats| | | |Yes| | | | | | | | | | | | | |Yes|
|
|
| | |Not relevant| |Yes| | | |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| | |
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
|
|
---
|
|
---++ Commentary
|
|
|
|
Main.BobMorris wrote this commentary on Kevin's scheme in email to Main.AnnieSimpson and Main.KevinThiele.
|
|
|
|
Kevin's spreadsheet is a representation of the first part of an _ontology_, namely the terminology. It's enough for humans, but machines
|
|
usually need a little more to make good use of the ontology in deciding whether two databases use of a term are comparable for various purposes.
|
|
Nevertheless, specialists could and should start with it because it is well thought out, easy to understand, and _at least_ as much as will
|
|
ultimately be needed to resolve terminological differences between databases. Note that there are in fact four ontologies illustrated here: "K.R. Thiele", "CBD" , "Thomson 1991 (from GISP)" and "GISP". The point of Kevin's scheme is to make them comparable, as much as possible.
|
|
|
|
His spreadsheet expresses "concepts" (e.g. 'invasive', 'alien', etc.), by telling the values they have for a small set of _properties_ ('Origin',
|
|
'Impact', ...). For each property, there are a small set of possible _values_ to be used in these descriptions. Kevin's hope, which is probably
|
|
very reasonable, is that with enough discussion, the discussants will agree that a small list of properties and values are expressive enough
|
|
that most data providers can express the list of concepts coded in their own databases using just those properties and values the community
|
|
agrees upon. Thus,rather than requiring the community of data providers (and consumers) to agree on a a single set of concepts, they instead
|
|
agree on a set of terms that are used to _express_ concepts. In turn, application writers can decide for themselves how to rationalize comparisons of
|
|
those concepts. I might write an application that enshrines a notion like "If two concepts agree on the majority of their property-value
|
|
pairs, the program will treat them as the same". You might write an application that is more conservative and enshrines a requirement that
|
|
two concepts must agree on 80% of their property-value pairs before considering them the same concept. Depending on the ultimate
|
|
representation of these property-value lists, more sophisticated machine reasoning is possible than just conclusions of the form "For my purposes
|
|
I can assume that CBD 'invasive' is equivalent to Thiele 'alien'+'invasive'".
|
|
|
|
I understand that Kevin believes his property-value terms are a start, not an end, for discussion among the specialists and other interested
|
|
parties, but regardless of this there are a few technical issues about how and where to represent such ontologies that the biologists shouldn't
|
|
take a position about. In particular, there are arguments about whether the property-value terms belong in the schema or in some external document, and
|
|
whether particular concept lists (the "definitions part of a database") belong in the data to be exchanged or in external documents. Those
|
|
debates should not hinder specialists from discussing Kevin's notions and initially representing them in a spreadsheet as he has. The most
|
|
important hidden technical issue is the extent to which proliferating terms also proliferates computation time unacceptably. What one desires is that if
|
|
the terminology is increased N-fold, the time that applications need to compare data from different databases increases at most N-fold. Alas, it
|
|
is easy in the ontology game for proliferation of terminology to become exponential in computation. For example, 10 times as many terms and
|
|
concepts could lead not to 10 times as much computation, but 2 to the 10th power---1024 times as much computation. So there will always be a
|
|
tension between adequacy of expression and small size---the two central disiderata of ontologies. There is a little bit of mathematical ontology
|
|
theory that allows one to tell whether you are in this exponential situation, but I have the impression that for the few interesting
|
|
ontologies coded in other ecology projects, the situation is, sadly, exponential and the advice is to try to limit terminology size if possible.
|
|
|
|
The good news about all this is that it is a well explored and very current topic (it is presently high on the radar of the TDWG
|
|
Architecture Group (TAG), and also of the GBIF Data Access and Data Integration (DADI) group, which are debating various standards to adopt
|
|
for ontology. The bad news is that ontology tools are very sophisticated, but not very suitable for other than those who make a
|
|
living crafting ontologies. The challenge to the GISIN technical group will be to make a representation that is nimble enough to get somethng
|
|
going in a short timeframe, yet limber enough to slip out and replace if TDWG decides on standards for ontologies. I'm pretty sure this is
|
|
possible if we find that some large fraction of the target databases can get along with ontologies that are, oh, say, significantly smaller than
|
|
the databases themselves, and significantly fewer in number than the number of databases!
|
|
|
|
In summary, my opinion is that initially, people should not be debating the concepts ("Invasive", "alien", ...) yet , but rather whether the primitive properties and values Kevin provides are powerful enough to express most of the concepts represented by the databases whose data would form part of GISIN.
|
|
|
|
An interesting document for that discussion might also be the [[http://www.mnla.com/pdf/invasive/MIPAG_final_050325_rev.pdf][Massachusetts Criteria for Evaluating Non-Native Plant Species for Invasiveness]] pointed out by Main.JenniferForman, who will shortly post some other things that comprise what amount to ontologies.
|
|
|
|
-- Main.BobMorris - 15 Mar 2006@
|
|
|
|
|
|
1.2
|
|
log
|
|
@none
|
|
@
|
|
text
|
|
@d1 1
|
|
a1 1
|
|
%META:TOPICINFO{author="BobMorris" date="1142392524" format="1.0" version="1.2"}%
|
|
d6 34
|
|
a39 34
|
|
|
|
|*Source*| | | |K.R. Thiele||||||||||| |CBD| |Thomson 1991 (from GISP)| |GISP|
|
|
|*Term*| | | |Alien| |Invasive| |Introduced| |Native| |Adventive| |Cryptogenic| |Alien| |Invasive| |Invasive|
|
|
|Origin| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Outside polygon of interest| |Yes| |Yes| |Yes| | | |Yes| | | |Yes| |Yes| |Yes|
|
|
| | |Inside polygon of interest| | | | | | | |Yes| | | | | | | | | | |
|
|
| | |Not relevant| | | | | | | | | | | | | | | | | | |
|
|
| | |Not known| | | | | | | | | | | |Yes| | | | | | |
|
|
|Human agent in range change?| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Yes| |Yes| |Yes| | | | | |Yes| | | | | | | |Yes|
|
|
| | |No| | | | | | | | | | | | | | | | | | |
|
|
| | |Not relevant| | | | | |Yes| | | | | | | |Yes| |Yes| | |
|
|
| | |Not known| | | | | | | | | | | |Yes| | | | | | |
|
|
| | |Not applicable| | | | | |Yes| | | | | | | | | | | | |
|
|
|Impact| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Has negative impact (at least sometimes)| | | | | | | | | | | | | | | | | | |
|
|
| | |Does not have negative impact| | | | | | | | | | | | | | | | | | |
|
|
| | |Not relevant| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| |Yes|
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
|Establishment| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Established| | | |Yes| | | |Yes| | | |Yes| | | |Yes| |Yes|
|
|
| | |Not established| | | | | | | | | | | | | | | | | | |
|
|
| | |Not relevant| |Yes| | | |Yes| | | |Yes| | | |Yes| | | | |
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
|Geographic Range| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Widespread and/or spreading| | | |Yes| | | | | | | | | | | |Yes| |Yes|
|
|
| | |Not widespread, not spreading| | | | | | | | | |Yes| | | | | | | | |
|
|
| | |Not relevant| |Yes| | | |Yes| |Yes| | | |Yes| |Yes| | | | |
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
|Ecological Range| | | | | | | | | | | | | | | | | | | | |
|
|
| | |Restricted to human-modified habitats| | | | | | | | | | | | | | | | | | |
|
|
| | |Occurs in natural habitats| | | |Yes| | | | | | | | | | | | | |Yes|
|
|
| | |Not relevant| |Yes| | | |Yes| |Yes| |Yes| |Yes| |Yes| |Yes| | |
|
|
| | |Not known| | | | | | | | | | | | | | | | | | |
|
|
d88 1
|
|
a88 2
|
|
-- Main.BobMorris - 15 Mar 2006
|
|
|
|
@
|
|
|
|
|
|
1.1
|
|
log
|
|
@none
|
|
@
|
|
text
|
|
@d1 1
|
|
a1 1
|
|
%META:TOPICINFO{author="AnnieSimpson" date="1142382409" format="1.0" version="1.1"}%
|
|
d41 49
|
|
@
|