We are happy to announce version 7.1 of our popular Muhimbi PDF Converter API and Server Platform and OCR and PDF/A Archiving API and Server Platform. The main new features are support for OCR (Optical Character Recognition) to convert scanned documents into fully searchable and indexable PDF files, and a completely overhauled converter for the EML (email) format that should really benefit those organisations that don’t use MS-Outlook’s MSG format to store email.
- Improved support for the MSG & EML based formats.
- OCR Facilities provided by Muhimbi’s server based PDF Conversion products
- Carry out OCR using a web service call (.NET)
- Carry out OCR using a web service call (Java)
- Carry out OCR using a SharePoint workflow
A quick introduction for those not familiar with the product: The Muhimbi PDF Converter Services is an ‘on premises’ server based SDK that allows software developers to convert typical Office files to PDF format using a robust, scalable but friendly Web Services interface from Java, .NET, Ruby & PHP based solutions. It supports a large number of file types including MS-Office and ODF file formats as well as HTML, MSG (email), EML, AutoCAD and Image based files and is used by some of the largest organisations in the world for mission critical document conversions. In addition to converting documents the product ships with a sophisticated watermarking engine, PDF Splitting and Merging facilities, an OCR facility and the ability to secure PDF files. A separate SharePoint specific version is available as well.
Scanned Document with OCRed text selected
In addition to the changes listed above, some of the main changes and additions in the new version are as follows:
1901CADFixCAD Conversion - AccessViolationException1931CADImprovementCAD Converter does not resolve externally referenced files1850CADImprovementAdd support for AutoCAD 20131916ConversionFixTIFF to PDF Conversion uses dimensions of first page for all pages1853ConversionFixPost processing PDF generated from TIF as 'Screen Optimised' scrambles PDF676ConversionImprovementExcel Conversion - Add support for PDF/A1930Cross-conversionFixFolder with Temp files cannot be deleted when converting DOC to HTML for some locales / regions1879EMLNewImplement conversion of RFC2045 / RFC5322 based EML files1965HTMLFixHTML Converter hangs on 0.5 page margin1920HTMLFixNot all URLs are recognised by HTML Converter1827HTMLFixHTML to PDF Conversion for some non-Roman languages lose characters1840HTMLFixLast line is truncated when converting HTML to PDF1953HTMLImprovementMixed fonts in same sentence are vertically offset when converting HTML to PDF1940HTMLImprovementHTML Conversion doesn't convert unencoded quotes1884HTMLImprovementAdd configurable delay to HTML to PDF conversion for pages heavy on JavaScript / DHTML (e.g. pages containing Google Maps)2009InfoPathImprovementFix InfoPath forms colour being lost on IE10 systems1939InfoPathImprovementInfoPath does not export to PDF well on systems with IE102010MergingFixSystem.NullReferenceException when saving merged file2012MergingFixInternal hyperlinks are broken when merging documents1990MergingFixUnexpected token DictionaryEnd while merging1982MergingFixSystem.IndexOutOfRangeException: Index was outside the bounds of the array. while merging PDF1984MergingFixBookmark targets bottom of page1968MergingFixNullreference error in PdfLoadedFormFieldCollection.GetFieldType while merging1978MergingFixError in 'PdfLoadedPageCollection.GetPage' while merging file1967MergingFixBlank pages while merging1943MergingFixFatal Error at 9670 while merging1935MergingFixMerged file is empty when merging large bitmapped PDFs1895MergingFixFatal Error when merging1892MergingFixSystem.NullReferenceException when merging2007MSGFixMSG - Unexpected line break using plain text conversion2014MSGFixMSG - Unicode / character encoding problem in HTML email2006MSGFixMSG - Hyperlink breaks during conversion1958MSGFixMSG - System.Exception: compressed-RTF CRC32 failed1959MSGFixMSG/EML Converter - Last line is missing from some converted emails1925MSGFixMSG to PDF - Plain text email carriage return handling is incorrect1913MSGFixMSG to PDF - RTF HTML MSG - incorrectly converted accents / diacritics1914MSGFixMSG to PDF - RTF HTML MSG - RTL languages not converted in correct order1904MSGFixMSG to PDF - Sometimes Attachment is not processed1911MSGFixMSG to PDF - Possible regression on in-line images1912MSGFixMSG to PDF - RTF HTML MSG - Azerbaijani, Maltese - some unicode characters not converted, left as \uXXXX1899MSGFixMSG to PDF - German special characters are sometimes not properly converted1882MSGFixMSG to PDF - RTF email is missing portion of first line in body text1885MSGFixMSG to PDF - Handle and Memory leak when converting signed MSG files1862MSGFixMSG to PDF - Incorrect font1863MSGFixMSG to PDF - Numbered list items not rendered1601MSGImprovementMSG to PDF - Improve line spacing in HTML to PDF Conversion1660MSGImprovementMSG to PDF - Test / Implement remaining languages1917MSGImprovementMSG to PDF - RTF HTML MSG - some languages causing small fonts1903MSGImprovementMSG to PDF - Implement Best Body Algorithm from MS-OXBBODY specification1881MSGImprovementMSG to PDF - Text opaque signed MIME messages lose formatting2015MSGNewMSG to PDF - Include email address in 'To' field995OCRNewOCR - Add support for OCR of PDF data to allow searchable PDFs1985OtherFixCannot set PDF Creator / Processor meta data for some files1972OtherFixLoading a PDF 1.7 document into a PDFDocument resets it to PDF 1.51952OtherFixCertain PDFs do not permit viewerpreferences to be read1906OtherFixOccasional Access Denied in Task Monitor on Win2K12 / InfoPath 20151799OtherImprovementUpgrade to .net 3.52061ProFixConverting between PDF Versions on a locale that uses ',' as a decimal separator sets the PDF Version to 1.11945ProFixPDF/A conversion - The DateTime represented by the string is not supported in calendar System.Globalization.GregorianCalendar.1922ProFixRe-processing existing PDF/A files for PDF/A output fails1909ProFixPDF/A Conversion fails when certain characters occur in the PDF Title1888ProFixImprove reliability of PDF/A2b conversions1849ProFixLinearization in combination with PDF/A fails1979ProImprovementAlways post process for PDFA when _outputFormatSpecificSettings.PostProcessFile == true1843ProImprovementAllow transparent content in PDF/A2b documents1974SecurityFixWhen security is removed from PDF files its contents still shows as encrypted
For more information check out the following resources:
- Product Page.
- Brochure.
- Release Notes.
- Administration Guide.
- User & Developer Guide.
- FAQ & Knowledge Base.
- Discussion Forum.
- All PDF Converter related Blog Posts.
As always, feel free to contact us using Twitter, our Blog, regular email or subscribe to our newsletter.
Download your free trial here (37MB). .
Labels: EML, MSG, News, OCR, pdf, PDF Converter Professional, PDF Converter Services