html_entities_ascii() just appeared under Projects; this is a very fast HTML escaping function for binary data and ASCII text. We will be using it to generate NZB fragments from our new database.
That's right, Newzbin is writing it's own database for Message-ID's; this will remove about 5 billion rows from our MySQL database and putting them in a form which can be accessed and stored more efficiently. We're benchmarking it at about 110,000 inserts/sec, which includes checksumming every page (for extra paranoia, considering we're using ZFS).
Once this is done, MySQL can concentrate on our file and other smaller tables, and we can look to extending our retention to match the recent spate of NSP's announcing 365 days and beyond.
It's funny how C is repeatedly turning out to be useful for a website mostly driven by PHP and Ruby; Newzbin depends on quite a lot of our custom C services and libraries. Let's enumerate some of them:
Our new database is creatively named msgiddbd; Message-ID DataBase Daemon.
I've spent a couple of hours applying a bit of polish to it, and now beta.freshbsd.org is starting to look pretty nice.