Beagle supports the following data sources:
- File system
- (Optional) locate backend to return results using the tool locate.
- Evolution mail, calendar, and addressbook
- Thunderbird mail, news, RSS feeds, and addressbook
- Korganizer events and TODOs
- KMail mail
- KAddressbook addressbook
- Gaim and Kopete instant messaging and IRC logs
- Firefox and Epiphany web pages (as you view them, through Browser Extension)
- Konqueror web pages
- Blam, Liferea (1.2) and Akregator RSS feeds
- Tomboy, KNotes, and Labyrinth notes
- Konversation IRC logs
- Opera Web History
- Empathy IM logs
- Live GMail Search
and extracts text and metadata from the following file formats:
- Folders
- Office Documents
- OpenOffice.org (sxw, sxc, sxi and more)
- OpenDocument (odt, ods, odp)
- Microsoft Office (doc, xls, ppt)
- AbiWord (abw)
- Scribus (sla)
- Rich Text Format (rtf)
- Text Documents
- HTML (xhtml, html, htm)
- Source code (Boo, C, C++, C#, Fortran, Java, JavaScript, Lisp, Matlab, Pascal, Perl, PHP, Python, Ruby, Scilab and Shell scripts)
- Latex, BibTeX
- Plain text (txt, any plain text file that isn't filed under any other category)
- Documentation/Help Documents
- Texinfo
- Man pages
- gzip and bzip2 compressed man pages
- Info pages
- gzip and bzip2 compressed info pages
- Docbook
- Monodoc
- Windows help files (chm)
- Images (jpeg, png, bmp, tiff, gif, svg)
- Audio (mp3, ogg, flac, ape, mpc, m4a, aac, tracker, amiga audio, wma)
- m3u and pls playlists
- Video (mpeg, asf, wmv, mng, mp4, quicktime and other formats supported by MPlayer or Totem)
- Archive files (zip, tar, gzip, bzip2) and their contents
- Application launchers
- Linux packages (ebuild, rpm, dpkg)
- Generic XSLT files
Beagle also allows users to write their own simple filters by using external programs. For example, one could use untex to extract text data from TeX files. To create these filters, users simply add information to the external-filters.xml file. Instructions and a sample config file can be found at external-filters.xml.sample file.
To obtain examples or to share your own external-filters.xml, check out the ExternalFiltersRepository.
To obtain the list of supported data sources for a beagle installation, give the command
$ beagle-info --list-backends
To obtain the list of supported filters, give the command
$ beagle-info --list-filters
