Will require some basic theora support (beyond existing detection), then a dedicated Tika parser for it Should ideally provide information on both the theora video, and the main audio stream