FPIndexing

Fingerprint Indexing

Experimental Results

Fingerprint Indexing is a new, simple but practically efficient non-perfect term indexing technique. It supports retrieval of terms matching a query term, matched by the query, or unifiable with it, with uniform, simple algorithms.

Fingerprint Indexing can be seen as a variant of non-perfect discrimination tree indexing, or as a generalization of top-symbol hashing. It represents terms by fingerprints, constant-lengths vectors of samples of terms at (potential) positions. Fingerprints are organized in tries, and terms are stored at the leaves of the trie. Retrieval follows all compatible branches at each fork.

Fingerprint indexing is described in Fingerprint Indexing for Paramodulation and Rewriting.

•Short version published in the Proceedings of the 6th IJCAR
‣Paper (306 kb PDF)
‣Abstract
‣BibTex
•Extended version
‣Paper (340 kb PDF)
‣Abstract
‣BibTeX
•Final test runs were performed with development versions of E 1.4. All can be reproduced with E 1.4-19.
‣E 1.4-19 source (~2.4 MB, .tgz)
•Test runs were done on the University of Miami Pegasus Cluster
‣Jobs were run on 2.4 GHz 8 Core Intel Xeon machines with 16 GB of main memory each
‣8 concurrent processes were scheduled per core
‣A CPU time limit of 300 seconds and a memory limit of 512 MB were in force for each individual test run
•The tests were run on all CNF and FOF problems from TPTP-5.2.0.
•The results are stored in protocol files
‣Each protocol contains results for all TPTP problems for a given parameter setting
‣Lines starting with a # are comment lines
‣In particular, the first line gives the command line options for the test run
‣Lines starting with a problem name contain data for that particular problem
‣The important columns are described in the legend.txt file.
•The results: EFPRes.tgz (~4.2 MB)

Stephan Schulz, schulz@eprover.org