Hello David,
Thanks for your interest in MonetDB/XQuery.
First of all, I am not aware of success stories about querying GATE
annotations, but then again, I am also not aware of any attempts to
query GATE annotations, and I think I probably would have known if
there were any.
The stand-off axes supported in MonetDB/XQuery are fairly generic in
the way that they allow for querying over a separate linear dimension
(whether they represent timestamps, positions in text, or
byte-offsets). The stand-off extensions that MonetDB/XQuery provides
merely provide convenient access to query for overlap and inclusion
for ranges in this dimension (/select-wide::* and /select-narrow::*).
Both operators can be easily written down in plain XQuery (would look
something like: for $other in $candidates for $item in $source where
$item/@start > $other/@start and $item/@end < $other/@end return
$item).
The benefit of the stand-off extensions in MonetDB/XQuery is two-fold:
convenient notation, and optimized processing (special indices are
created such that querying GB's of stand-off XML can be done in
interactive time). The current implementation only recognizes the
notation in which a stand-off range of an XML-element is expressed
with two XML-attributes with numeric values indicating start and end
of the range. So in your case you probably would need to convert the
GATE annotations to this format. By the way: the names of the
XML-attributes can be configured in MonetDB.conf.
I hope this answers your question, and I hope you can determine
whether MonetDB/XQuery is suited for your needs. Please let me know if
you have more questions. I would also be keen to hear about and/or
help out with querying GATE using the stand-off extensions.
Greetings,
Wouter
2010/2/23 David Epstein
Hello,
I am looking for information and experiences with monetdb's standoff annotation extension for Xquery.
In particular, I'm looking for a way to more flexibly and conveniently search through documents I've manually annotated for semantic content using GATE (General Architecture for Text Engineering), which has its own standoff format. Has anyone had success in querying GATE documents using monetdb xquery with the standoff extension?
If not GATE, what are people using to produce the standoff notation?
thank you, -david
David Epstein PhD Candidate Urban & Regional Planning University of Michigan USA
------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ MonetDB-users mailing list MonetDB-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/monetdb-users