About
Corrupt Office Suite
Text/Data Extracting
Service - BETA
My name is Paul Pruitt and my company is S2
Services. This service is an outgrowth of the S2 Services Data Recovery Freeware
List, which I have been maintaining since 2003.
With Office 2007, Word, Excel and PowerPoint started using a new
default file format. These files are really zipped collections of
XML files despite their various extensions like docx, xlsx etc. You
can see this by changing any of these file extensions to zip and
opening it with favorite unzipper or Windows Explorer's built-in
unzipper.
This format has made data recovery easier. Most corruption
issues with these new files are due to zip corruptions. The various
Office programs do a pretty good job of recovering corrupt files
unless the "content" xml files are the part of the zip that is
corrupted, then they sometimes fail. The text in a Word 2007 or
2010 docx file is found in the document.xml file for instance and
if this is the part of the zip that is corrupted it will sometimes
trip up Word. My service does a little better at recovering data
from the content files than does MS Office itself.
The service is possibly insecure and does not
have security certification. Please do not upload security
sensitive files. Instead see the links
section for freeware and Open Source ware that does the same
extracting or check out the list in the next paragraph.
For secure local work on your own computer, try the
freeware RepairMyWord
(only works with Word 97-2003 files),
Damaged docx2txt,
Corrupt xlsx2csv, Corrupt Office
Extractor. Or the command line SILVERCODERS
DocToText, Ccy's Corrupt
Office 2007 Extractor Command-line and Sandeep Kumar's docx2txt.
If none of these free methods work or you need to fix or extract
text from older Excel or PowerPoint 97-2003 format files, I
recommend trying the demos for the
commercial
WordFix, or
ExcelFix. For PowerPoint recovery or other commercial recovery
software and service vendors try these links: Corrupt File
Recovery Services and Corrupt
File Commercialware.