Ramblings: XQuery parser performance

Monday, 13 February 2012

XQuery parser performance

This post was updated on 15th Feb with the BaseX 7.1.1 results

A comparison of XQuery engine performance running the XQuery parser from xquerydoc project. The test parses the XQuery program string "2+3":

import module namespace p="XQueryV30" at "XQueryV30.xq";
p:parse-XQuery("2+3")

Engines

Zorba XQuery Engine, Version: 2.1.0
BaseX 7.1.1 Beta [Standalone]
Saxon-HE 9.4.0.2J
MXQuery 0.6.0

java version "1.6.0_26"
Java(TM) SE Runtime Environment (build 1.6.0_26-b03)
Java HotSpot(TM) Client VM (build 20.1-b02, mixed mode, sharing)

Running on Ubuntu 11.04 on a Thinkpad T42.

Zorba

http://www.zorba-xquery.com

time zorba -f -q test_xqparser.xq 
real 0m24.562s 
user 0m21.489s 
sys  0m0.240s

BaseX

http://basex.org

Results for version 7.1.1 (BaseX711-20120215.234615)


time basex test_xqparser.xq
real 0m1.601s
user 0m1.260s
sys 0m0.088s

Results for version 7.1.0

time basex test_xqparser.xq
real 96m29.589s
user 54m35.961s
sys 0m19.533s

Saxon

Installed using installing-saxon-he-ubuntu.html

time saxon-xq test_xqparser.xq 
real 0m2.673s
user 0m2.372s
sys  0m0.140s

MXQuery

http://mxquery.org/

java -Xms1024m -Xmx1024m  -jar mxquery.jar -f test_xqparser.xq
MXQuery 0.6.0
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space

All scripts are run from the xqparserperf/src directory.

Github: xqparserperf

(import-existing-source-code-to-github)

8 comments:

Adam Retter16 February 2012 at 21:32
Andy, you might like to test with eXist-db using the Java Admin Client running in embedded mode. My measurements, probably on a less powerful machine (VM) are 508 seconds for compilation of the query and 283 ms for execution.
ReplyDelete
Replies
Jim Fuller17 February 2012 at 09:39
on MarkLogic 5.0 I get

4063 Expressions

PT0.01154S
ReplyDelete
Replies
Content Mangler17 February 2012 at 12:22
Recently, I contributed some optimizations to performance to REx that improved performance by at least 50% on MarkLogic. The key improvement was using fn:subsequence instead of predicate range expressions. I would re-run the ebnf through rex to get new statistics. Thanks Gunther for his attention.
ReplyDelete
Replies

Add comment