tag/databasedmo.cahttp://www.dmo.ca//tag/database/dmo.caikiwiki2009-05-16T01:45:14ZBenchmarking BDB, CDB and Tokyo Cabinet on large datasetshttp://www.dmo.ca//blog/benchmarking-hash-databases-on-large-data/2009-05-16T01:45:14Z2009-05-16T01:45:14Z
<p>At <a href="http://www.roaringpenguin.com/">my job</a> we have need of a high-performance
hash lookup database in our <a href="http://www.roaringpenguin.com/products/canit-pro">antispam product</a>.
It's used to store Bayes tokens for quick lookups on individual scanning
systems, and is read-only in the fast path (mail scanning) with updates taking
place in another process. For the last few years, we've been using a plain old
BerkeleyDB hash database via Perl's DB_File, but with all the hype about <a href="http://tokyocabinet.sourceforge.net/">Tokyo
Cabinet</a> and its <a href="http://tokyocabinet.sourceforge.net/benchmark.pdf">benchmark
results</a> I figured it was
time to take a look.</p>
<p><a href="http://www.dmo.ca//tag/database/../../blog/benchmarking-hash-databases-on-large-data/#more">more</a></p>