wiki-archive/twiki/data/Phylogenetics/WorkingMeeting2010.txt,v

533 lines
20 KiB
Plaintext

head 1.17;
access;
symbols;
locks; strict;
comment @# @;
1.17
date 2010.10.03.16.22.45; author NicoCellinese; state Exp;
branches;
next 1.16;
1.16
date 2010.10.01.20.06.45; author HilmarLapp; state Exp;
branches;
next 1.15;
1.15
date 2010.10.01.18.09.40; author HilmarLapp; state Exp;
branches;
next 1.14;
1.14
date 2010.10.01.16.09.20; author HilmarLapp; state Exp;
branches;
next 1.13;
1.13
date 2010.10.01.12.51.38; author ArlinStoltzfus; state Exp;
branches;
next 1.12;
1.12
date 2010.09.30.05.11.52; author ChristopherGoddard; state Exp;
branches;
next 1.11;
1.11
date 2010.09.30.03.01.14; author TorstenEriksson; state Exp;
branches;
next 1.10;
1.10
date 2010.09.29.21.59.30; author ArlinStoltzfus; state Exp;
branches;
next 1.9;
1.9
date 2010.09.29.21.19.01; author DanRosauer; state Exp;
branches;
next 1.8;
1.8
date 2010.09.29.21.16.20; author ArlinStoltzfus; state Exp;
branches;
next 1.7;
1.7
date 2010.09.29.15.53.09; author NicoCellinese; state Exp;
branches;
next 1.6;
1.6
date 2010.09.29.14.44.02; author HilmarLapp; state Exp;
branches;
next 1.5;
1.5
date 2010.09.29.14.12.07; author ArlinStoltzfus; state Exp;
branches;
next 1.4;
1.4
date 2010.09.29.13.17.30; author HilmarLapp; state Exp;
branches;
next 1.3;
1.3
date 2010.09.28.16.07.49; author ArlinStoltzfus; state Exp;
branches;
next 1.2;
1.2
date 2010.09.28.13.16.14; author NicoCellinese; state Exp;
branches;
next 1.1;
1.1
date 2010.09.27.22.01.33; author HilmarLapp; state Exp;
branches;
next ;
desc
@none
@
1.17
log
@none
@
text
@%META:TOPICINFO{author="NicoCellinese" date="1286122965" format="1.1" version="1.17"}%
%META:TOPICPARENT{name="WebHome"}%
---+ Working Meeting at TDWG 2010
We held a full-day hands-on working meeting at the 2010 TDWG Conference in Woods Hole, MA, on Wednesday, Sep 29. The meeting took place in room MRC #310 (in the [[http://www.tdwg.org/fileadmin/2010conference/graphics/WHMapv2.pdf][building marked with 8 in the MBL building map]]).
We started with short introductions around the room, and then decided on development targets according to participants' interests.
---++ Participants
<div style="float: right; width: 305px; margin-left: 5px;"><img src="%ATTACHURLPATH%/TDWG_Phylo_2010-1.jpg" alt="TDWG_Phylo_2010-1.jpg" width="300px" />
<span style="font-size: x-small;">Elena Herzog, Dan Rosauer, Kate Rachwal, and Chris Goddard are unfortunately missing from this photo.</span>
</div>
| *Name* | *Institution* |
| Nico Cellinese | Florida Museum of Natural History, University of Florida |
| Torsten Eriksson | Swedish Museum of Natural History, Stockholm |
| Chris Goddard | Florida Museum of Natural History, University of Florida |
| Elena Herzog | Bik-F - Biodiversity and Climate Research Center |
| Hilmar Lapp | NESCent |
| Kate Rachwal | Florida Museum of Natural History, University of Florida |
| Dan Rosauer | Yale University |
| Arlin Stoltzfus | NIST, University of Maryland |
---++ Development targets
The [[https://docs.google.com/document/pub?id=1ihH6MGKBCpH10sqvDK5ahLgvz6P_qP-TaN-vLt6O9j8][meeting abstract]], which incorporated the input obtained from interest group members through the mailing list, suggested several targets. These were further narrowed down at the meeting to the list below.
1. LinkingTrees2010 (Dan, Jamie, Arlin, Torsten, Elena)
* Work out *best practices for publishing a tree electronically*, via an archive such as [[http://www.treebase.org TreeBase]] or [[http://datadryad.org Dryad]], or via the semantic web (presumably using [[http://www.evolutionaryontology.org/cdao CDAO]]). At minimum, get some actual trees (+/- matrices) ready to submit. At best, draft a how-to guide for publication.
* Make it easy for users to put species name (or other taxonomic identifier), specimen or source identifier, and geographic coordinates into their trees.
1. Further develop the [[http://www.evoio.org/wiki/Phyloreferencing_subgroup][Phyloreferencing standard]]. _Hilmar Lapp_, _Nico Cellinese_
* Compare with Mike Keesey's [[http://namesonnodes.org/][Names on Nodes]].
* [[https://spreadsheets.google.com/pub?key=0AmUeHhtnDeCZdGRFMHQ5U2RVSG9CNFBRanpKZlJHYVE&hl=en&output=html][Phyloreferencing vocabulary draft]] started at the 2009 !VoCamp.
* [[http://sourceforge.net/apps/mediawiki/treebase/index.php?title=API][TreeBASE API]] and the full [[https://spreadsheets.google.com/pub?key=0Av8UW3JvZsgcckwtLU83cHloUjhGY25uRzUtb2ZBbHc&hl=en&single=true&gid=0&output=html][Metadata query vocabulary used by TreeBASE]]
1. Increase the support for the emerging data standards (e.g., !NeXML) among tree visualization tools (Kate)
* For example, [[http://jsphylosg.com][jsPhyloSVG]] supports !phyloXML, but not !NeXML. Same with [[http://code.google.com/p/phylobox][Phylobox]].
* Or, write an XSLT to translate [[http://www.phyloxml.org][phyloXML]] to [[http://www.nexml.org][NeXML]], and vice versa. The good news is that XML is emerging as a standard for serializing trees and associated data. The bad news is that there are 2 different schemas.
1. Ruby-on-Rails / Web-application version of Phenex functionality (Chris)
---++ Outcomes
---+++ LinkingTrees2010
_Dan, Jamie, Arlin, Torsten, Elena_
Best practices for publishing a tree so that it is reusable and can be linked to other data.
Outcomes
* Identified key features that maximize potential for reuse or integration (accn, lsid or other taxo ref, geo coord)
* identified resources available (file formats, data archives, tools)
* !TreeBASE supports archiving with capacity to add key features (NCBI or UBio name matching)
* Tested with reasonable success using files from Mesquite and Bio::NEXUS
* !TreeBASE steps one through the process.
* found out how !NeXML and !PhyloXML can represent key features
* (partly) created set of matching test files in different formats
* Planned follow-up (Dan, Arlin)
* Report for TDWG
* request comments from a larger group of stakeholders
* possible manuscript for publication.
---+++ Further develop the [[http://www.evoio.org/wiki/Phyloreferencing_subgroup][Phyloreferencing standard]]
_Hilmar, Nico_
Outcomes
1. Trying to understand what [[http://namesonnodes.org][Names on Nodes]] (Keesey) might contribute
* Relative to previous effort, doesn't add biologically significant use cases (except perhaps to remind us that crown and stem clade modifiers are not yet included in the phyloreference specification).
* The tool has a very nice UI, and the code (Macromedia Flash) could possibly be modified to accept phyloreference expressions, in addition to !MathML, which it was written for. (It does apparently already support !NeXML.)
1. [[https://spreadsheets.google.com/pub?key=0AmUeHhtnDeCZdGRFMHQ5U2RVSG9CNFBRanpKZlJHYVE&hl=en&output=html][Standard vocabulary]] for pinning down the semantics of specifiers in phyloreferences (such as denoting taxon identifier, scientific name, specimen identifier, sequence accession numbers, etc. This was left hanging from the !VoCamp.
* Determined that this significantly overlaps with, and hence should be merged into the !PhyloWS query predicate vocabulary, for which !TreeBASE currently publishes the de-facto draft (see [[http://treebase.svn.sourceforge.net/viewvc/treebase/trunk/treebase-core/src/main/resources/treebase.owl][treebase-core/src/main/resources/treebase.owl]] in the !TreeBASE codebase).
* *Follow-up step:* Need to create a draft master query predicate vocabulary as part of the !PhyloWS standard specification, then merge in terms needed for phyloreferencing but currently lacking from the vocabulary.
* *Follow-up step:* Need to initiate a discussion whether the !PhyloWS query predicate should stand on its own, or should in the long run better be merged into Darwin Core or CDAO.
1. Taxonomy broker: This was tabled at the 2009 !VoCamp. A taxonomy broker returns species names (tip labels) in exchange for higher-level taxon names, !PhyloCode names, type specimen identifiers, and other "canonical" metadata that not every phylogenetic data provider might have for every tree.
* Determined that this is a key piece of infrastructure, and a current gap. Without it, some of the most intuitive use-cases will not work, such as querying for all trees containing _some_ "Aves" when most trees won't have internal node labels.
* We thought that a limited, yet still useful implementation could be scoped such that it is feasible as a Summer of Code project, so that the benefits could be demonstrated. The implementation would work on top of an existing taxonomy or nomenclatural authority, such as the NCBI taxonomy, and !RegNum.
* Based on a conversation with Rich Pyle, it also appears that GNA (Global Names Architecture) may have many of the necessary pieces and services.
* *Follow-up steps:* Check the GNA codebase for relevant pieces (once published?). Write Summer of Code project idea (Feb 2011).
---+++ !PhyloXML to !NeXML interconversion
_Kate_
Using XSLT to convert between two emerging XML standards for phylogenetics:
* Started with a stylesheet that takes as input a !phyloXML document and converts to !NeXML.
* Planned follow-up:
* Kate commits for another week
* expects to finish XSLT that converts at least the trees !phyloXML to !NeXML.
* finishing the job might be doable within a month.
---+++ Web interface to phenex functionality
_Chris_
Web-based ontology annotation of phylogenetic data: porting Phenex to the web
* Preliminary work is on !GitHub: [[http://github.com/c1sgo/rjs_phenex]]
* Started with the basic framework classes and windows.
* Might take only around 2-3 weeks to make an initially functional version.
* Chris is committed to at least a few weeks more work on this
%META:FILEATTACHMENT{name="TDWG_Phylo_2010-1.jpg" attachment="TDWG_Phylo_2010-1.jpg" attr="" comment="" date="1285949256" path="TDWG Phylo 2010-1.jpg" size="1257206" stream="TDWG Phylo 2010-1.jpg" user="Main.HilmarLapp" version="1"}%
@
1.16
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="HilmarLapp" date="1285963605" format="1.1" reprev="1.16" version="1.16"}%
d12 1
a12 1
<span style="font-size: x-small;">Elena Herzog, Kate Rachwal, and Chris Goddard are unfortunately missing from this photo.</span>
@
1.15
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="HilmarLapp" date="1285956580" format="1.1" reprev="1.15" version="1.15"}%
d42 1
a42 1
---+++ LinkingTrees2010
d44 1
a44 1
Dan, Jamie, Arlin, Torsten, Elena
d62 1
a62 2
Hilmar, Nico
d66 14
a79 9
* implementation could be adapted to compute phyloreferences via !phyloWS
* relative to previous effort, doesn't add biologically significant use case
1. Left hanging from !VoCamp
* vocabulary that can be used to pin down semantics of specifiers in phyloreferences
* taxonomy broker (allows people to query resources)
* e.g., query "aves" in tb even thhough tb entries don't have "aves" labels on nodes
* query authority with voucher specimen
---+++ !PhyloXML to !NeXML interconversion (Kate)
d88 2
a89 1
---+++ Web interface to phenex functionality (Chris)
@
1.14
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="HilmarLapp" date="1285949360" format="1.1" reprev="1.14" version="1.14"}%
d5 1
a5 1
We are holding a full-day hands-on working meeting at the 2010 TDWG Conference in Woods Hole, MA, on Wednesday, Sep 29. The meeting is currently scheduled for room MRC #310 (in the [[http://www.tdwg.org/fileadmin/2010conference/graphics/WHMapv2.pdf][building marked with 8 in the MBL building map]]).
d7 1
a7 1
The meeting is open to all. We will start with short introductions around the room for any participants that are new, and then decide on development targets according to participants' interests. We do intend to break into working subgroups or teams relatively quickly, and so we are currently not planning on overview or other introductory presentations. (However, a subgroup may choose to first update itself on research and tool resources that pertain to their chosen target.)
d11 3
a23 2
<img src="%ATTACHURLPATH%/TDWG_Phylo_2010-1.jpg" alt="TDWG_Phylo_2010-1.jpg" width="300px" />
d26 1
a26 1
The [[https://docs.google.com/document/pub?id=1ihH6MGKBCpH10sqvDK5ahLgvz6P_qP-TaN-vLt6O9j8][abstract of the meeting]] (generated from input solicited from interest group members through the mailing list) suggests several targets. The list below is tentative at this stage. Please add either more specifics to a listed target, or a completely new target. Also, add your name to those targets that you are most interested in working on. (Putting your name down is an expression of interest, not a commitment yet. Hence, you can put your name down for more than one target.) Also, there is the mailing list (see Gettting Involved) for discussion prior to Wed, or post on Twitter using the #TDWG hashtag.
d33 1
a33 1
* [[https://spreadsheets.google.com/pub?key=0AmUeHhtnDeCZdGRFMHQ5U2RVSG9CNFBRanpKZlJHYVE&hl=en&output=html][Phyloreferencing vocabulary draft]] started at the 2009 VoCamp.
d35 4
a38 4
1. Increase the support for the emerging data standards (e.g., NeXML) among tree visualization tools (Kate)
* For example, [[http://jsphylosg.com][jsPhyloSVG]] supports phyloXML, but not NeXML. Same with [[http://code.google.com/p/phylobox][Phylobox]].
* Or, write an XSLT to translate [[http://www.phyloxml.org phyloXML]] to [[http://www.nexml.org NeXML]], and vice versa. The good news is that XML is emerging as a standard for serializing trees and associated data. The bad news is that there are 2 different schemas.
1. web interface to phenex functionality (Chris)
d51 1
a51 1
* TreeBASE supports archiving with capacity to add key features (NCBI or UBio name matching)
d53 2
a54 2
* TreeBASE steps one through the process.
* found out how NeXML and PhyloXML can represent key features
d66 2
a67 2
* first: trying to understand what Names on Nodes (Keesey) might contribute
* implementation could be adapted to compute phyloreferences via phyloWS
d69 1
a69 1
* second: left hanging from vocamp
d75 1
a75 1
---+++ PhyloXML to NeXML interconversion (Kate)
d78 1
a78 1
* Started with a stylesheet that takes as input a phyloXML document and converts to NeXML.
d81 1
a81 1
* expects to finish XSLT that converts at least the trees phyloXML to NeXML.
d87 1
a87 1
* Preliminary work is on GitHub: [[http://github.com/c1sgo/rjs_phenex]]
@
1.13
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="ArlinStoltzfus" date="1285937498" format="1.1" reprev="1.13" version="1.13"}%
d21 2
d90 2
@
1.12
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="ChristopherGoddard" date="1285823512" format="1.1" version="1.12"}%
d9 12
a35 1
* Preliminary work is on GitHub: [[http://github.com/c1sgo/rjs_phenex]]
d37 7
a43 1
---++ Participants
d45 43
a87 9
| *Name* | *Institution* |
| Nico Cellinese | Florida Museum of Natural History, University of Florida |
| Hilmar Lapp | NESCent |
| Arlin Stoltzfus | NIST, University of Maryland |
| Dan Rosauer | Yale University |
| Chris Goddard | Florida Museum of Natural History, University of Florida |
| Elena Herzog | Bik-F - Biodiversity and Climate Research Center |
| Kate Rachwal | Florida Museum of Natural History, University of Florida |
| Torsten Eriksson | Swedish Museum of Natural History, Stockholm |@
1.11
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="TorstenEriksson" date="1285815674" format="1.1" reprev="1.11" version="1.11"}%
d24 1
d36 1
a36 1
| Torsten Eriksson | Swedish Museum of Natural History, Stockholm |
@
1.10
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="ArlinStoltzfus" date="1285797570" format="1.1" reprev="1.10" version="1.10"}%
d13 1
a13 1
1. LinkingTrees2010 (Dan, Jamie, Arlin, Thorsten, Elena)
d35 1
@
1.9
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="DanRosauer" date="1285795141" format="1.1" version="1.9"}%
d13 1
a13 1
1. LinkingTrees2010:
d15 1
a15 1
* Make it easy for users to put species name (or other taxonomic identifier), specimen or source identifier, and geographic coordinates into their trees.
d20 1
a20 1
1. Increase the support for the emerging data standards (e.g., NeXML) among tree visualization tools.
d22 3
a24 2
* Or, write an XSLT to translate [[http://www.phyloxml.org phyloXML]] to [[http://www.nexml.org NeXML]], and vice versa. The good news is that XML is emerging as a standard for serializing trees and associated data. The bad news is that there are 2 different schemas.
d34 1
a34 1
| Kate Rachwal | Florida Museum of Natural History, University of Florida |@
1.8
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="ArlinStoltzfus" date="1285794980" format="1.1" version="1.8"}%
d30 1
a30 1
| Dan Rosauer | Yale |
@
1.7
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="NicoCellinese" date="1285775589" format="1.1" version="1.7"}%
d14 2
a15 2
* Make it easy for users to put species name (or other taxonomic identifier), specimen or source identifier, and geographic coordinates into their trees. _Arlin Stoltzfus_, _Dan Rosauer_
* Work out *best practices for publishing a tree electronically*, via an archive such as [[http://www.treebase.org TreeBase]] or [[http://datadryad.org Dryad]], or via the semantic web (presumably using [[http://www.evolutionaryontology.org/cdao CDAO]]). At minimum, get some actual trees (+/- matrices) ready to submit. At best, draft a how-to guide for publication. _Arlin Stoltzfus_
a22 2
1. Expose the taxonomic data in online biodiversity data resources (such as EOL and Scratchpads) through a standard data access interface (e.g., PhyloWS).
* One of the possible targets here might be to create an application that uses the EOL API to expose its data according to the PhyloWS standard. This would probably use mostly the classification API to EOL.
@
1.6
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="HilmarLapp" date="1285771442" format="1.1" reprev="1.6" version="1.6"}%
d35 1
@
1.5
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="ArlinStoltzfus" date="1285769527" format="1.1" reprev="1.5" version="1.5"}%
d16 4
a19 1
1. Further develop the [[http://www.evoio.org/wiki/Phyloreferencing_subgroup][Phyloreferencing standard]]. (compare with Mike Keesey's [[http://namesonnodes.org/ Names on Nodes]]. _Hilmar Lapp_, _Nico Cellinese_
d26 1
a26 1
---++ Participants
@
1.4
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="HilmarLapp" date="1285766250" format="1.1" version="1.4"}%
d13 4
a16 2
1. LinkingTrees2010: Make it easy for users to put species name (or other taxonomic identifier), specimen or source identifier, and geographic coordinates into their trees. _Arlin Stoltzfus_, _Dan Rosauer_
1. Further develop the [[http://www.evoio.org/wiki/Phyloreferencing_subgroup][Phyloreferencing standard]]. _Hilmar Lapp_, _Nico Cellinese_
d19 1
d22 2
a23 4
1. Work out *best practices for publishing a tree electronically*, via an archive such as [[http://www.treebase.org TreeBase]] or [[http://datadryad.org Dryad]], or via the semantic web (presumably using [[http://www.evolutionaryontology.org/cdao CDAO]]). At minimum, get some actual trees (+/- matrices) ready to submit. At best, draft a how-to guide for publication. _Arlin Stoltzfus_
1. Write an XSLT to translate [[http://www.phyloxml.org phyloXML]] to [[http://www.nexml.org NeXML]], and vice versa. The good news is that XML is emerging as a standard for serializing trees and associated data. The bad news is that there are 2 different schemas.
1. Develop a plan to integrate [[http://search.cpan.org/dist/Bio-NEXUS Bio::NEXUS]] with BioPerl or Bio::Phylo. Bio::NEXUS is a Perl API for NEXUS files with a sophisticated parser and a toolbox full of goodies. But the NEXUS IO is a limiting factor in its use. _Arlin Stoltzfus_
---++ Participants
d31 1
a31 1
| Elena Herzog | Bik-F - Biodiversity and Climate Research Center |@
1.3
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="ArlinStoltzfus" date="1285690069" format="1.1" reprev="1.3" version="1.3"}%
a28 1
| Bill Piel | Yale |
d30 1
@
1.2
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="NicoCellinese" date="1285679774" format="1.1" version="1.2"}%
d19 3
a21 1
d30 1
a30 1
| Chris Goddard | Florida Museum of Natural History, University of Florida |@
1.1
log
@none
@
text
@d1 1
a1 1
%META:TOPICINFO{author="HilmarLapp" date="1285624893" format="1.1" reprev="1.1" version="1.1"}%
d28 1
@