Text Mining Tool is a freeware program for extraction of text from files of the next types:
pdf, doc, rtf, chm, html without need to have installed any other programs like Word, Arcrobat, etc.
The beauty of the program is that it works, extremely simply, on almost all common forms of documents.
That includes HTML web pages, both DOC and RTF document formats from Microsoft Word and others like
Open Office, Windows Help files ending in CHM, and portable documents using PDF format.
Its comfortable and easy usage is defined by the following key features:
- No payment or license restrictions. Tool is absolutely free.
- Works as converter of PDF, DOC, RTF, CHM, HTML files to text.
- User-friendly interface with hotkeys available.
- Console tool minetext for automation of text converting is included.
- .NET 2.0 framework based.
- No installation is needed. Just unpack the program and use.
Download details
Attention! If you do not have .NET 2.0 framework installed, you must download it from this page
Download Text Mining Tool (8594Kb)Additional information
For the sake of convinience the following hotkeys can be used to perform the operations:
- Open - F3 or O.
- Save - F2 or S.
- Clipboard - F5 or C.
- Exit - F10 or Escape.
The included console tool minetext, which can be helpful for developers or system administrators, has such syntax:
minetext <input file>
minetext <input file> <output file>
where:
<input file> - any file with one of the following extensions:
pdf, doc, rtf, chm, htm, html
<output file> - file you want to write text mined from input file





