White Paper: Getting the Most of Your ABBYY Data Capture System
When you think about a data capture system, what goes through your mind? Maybe you think about the reduction in paper you will realize, easier access to content, or more space due to fewer filing cabinets. While these may be worthy thoughts, a feature rich and correctly utilized data capture system can be so much more.
Capture More Than Paper
When most people think of data capture they immediately think of capturing paper documents. While capturing paper document and the goodness that comes with that is important (fewer filing cabinets, less clutter, a ‘green’ initiative), more and more capturing documents other then paper is just as important. For instance, capturing emails, which have proliferated in recent years, has become a segment of the enterprise content management (ECM) onto itself. Articles and best practices on email management abound. And a data capture system that allows for the capture of emails is an important part of that email management movement. Utilized correctly, a good data capture system will allow for email inboxes to be monitored, emails and their attachments captured, attachments and their associated pages, extracted, and notifications configured throughout the whole email process. Although fewer in number than in the past, a data capture system should also have the ability to capture incoming faxes. By capturing fax documents directly from one or more fax servers, you should no longer have to scan paper fax printouts unless you really just enjoy doing that. Finally, the capture of electronic files should be realized as well. A well designed data capture system will allow you to capture all types of images, indexed documents, and other files (such as Word documents, Excel spreadsheets, etc.) from a file system. So besides capturing paper, get the most out of your data capture system by capturing the rest of the document world that resides in emails, faxes, and electronic files.
Easing The Document Processing Bottleneck
Executive Summary
Most organisations already employ digitisation systems for the purpose of processing and storing information in multiple electronic file formats which are then accessible by a wide range of software applications. But few have yet implemented formal procedures and technologies for controlling the process, with systems to centralise document scanning, capture, data classification and distribution, remaining the exception rather than the norm.
Based on a Computing survey of almost 200 ICT and finance managers this white paper provides a detailed snapshot of how businesses capture and manage both paper-based and electronic information coming into their organisation, and identifies whether certain types of data and document formats are easier to process than others.
It also looks at the problems that inefficient document management systems can create for employees and business partners by causing data processing delays or obscuring mission-critical information, and examines the drivers behind any appetite for document management system upgrades or implementation within individual companies.
The Document Mountain: As Broad as it is High
The range of documents that finance departments deal with is broad, with invoices, purchase orders and expenses forms being regularly processed in the majority of cases, followed by cheques, tax forms, contracts and other types of non-specified remittances (Fig. 1). Others included SAP reports, timesheets, BACS/Swift payments, credit card reports and wage slips.
ABBYY FlexiCapture White Paper
Automated data input technologies have a relatively long history - dating back to the days whenthe first optical reading systems were developed to recognize stylized symbols drawn according to templates. Since that time, they have evolved to support a vast industry, utilizing a large set of very different technologies.
The traditional machine-readable form processing technologies of today are well-established. A large choice of systems capable of processing many types of machine-readable forms is now available. Today's advanced systems can accurately capture machined printed and handwritten characters and process thousands of documents per day. ABBYY FormReader is one of the leading products in the field, capable of handling both printed and hand-printed forms (see http://www.formreader.com or contact ABBYY for a whitepaper and additional information on ABBYY form processing technology).
Yet while today's form processing systems are very advanced, they are still limited in functionality. For example, the task of processing semi-structured documents, or forms and documents on which the sizes and locations of fields of key pieces of data varies from document to document, still remains the most challenging task in data capture. While the demand for solutions to address this area is extremely high, form processing programs have not been flexible and intelligent enough to process these types of documents without extensive customization and system training. Access to an easy-to-deploy,cost-effective solution for processing such documents as invoices, order forms, legacy forms, and template-based contracts has been, until now, virtually inaccessible by
a large audience.
For these types of cases, even when full-text documents are being handled, the ultimate aim is to extract a particular set of fields, or key pieces of information, from a given page. We will refer to such documents as flexible forms.
ABBYY® Aligner A Simple and Convenient Solution for Fast Creation of Translation Memory Databases
ABBYY® Aligner is a tool for aligning parallel texts in different languages and creating Translation Memory (TM) databases. ABBYY’s 20-year experience of developing linguistic software and other products is reflected in the unique advantages offered by ABBYY Aligner:
• Based on ABBYY’s advanced linguistic technology, ABBYY Aligner ensures excellent quality 1 of parallel text alignment
• The software enables processing of texts in various languages
• ABBYY Aligner supports batch processing for quick automatic alignment of large volumes of documents
• ABBYY Aligner has a simple and convenient interface 2, installs on user computers in a few quick and easy steps and is ready for use right out of the box
• The texts processed with ABBYY Aligner can be exported into TMX format, facilitating compatibility with any CAT tools or other systems designed to handle TM databases
All this makes ABBYY Aligner an indispensable tool for translators, language service providers, corporations and all who involved in managing translation activities, allowing them to improve the quality of translations, increase translation speed, minimize expenses when working with translation agencies, and reduce the workload of translation departments.
Advantages of ABBYY Aligner:
Saves Time
The high quality of parallel text alignment, the convenient and intuitive interface and special functions for processing large volume of documents help to achieve quality results quickly and re-use them when making other translations. Using the TM base created with ABBYY Aligner while working at new translations help to do the job in shorter terms.
ABBYY® Aligner: A Simple and Convenient Solution for Fast Creation of Translation Memory Databases
ABBYY® Aligner is a tool for aligning parallel texts in different languages and creating Translation Memory (TM) databases. ABBYY’s 20-year experience of developing linguistic software and other products is reflected in the unique advantages offered by ABBYY Aligner:
• Based on ABBYY’s advanced linguistic technology, ABBYY Aligner ensures excellent quality 1 of parallel text alignment
• The software enables processing of texts in various languages
• ABBYY Aligner supports batch processing for quick automatic alignment of large volumes of documents
• ABBYY Aligner has a simple and convenient interface 2, installs on user computers in a few quick and easy steps and is ready for use right out of the box
• The texts processed with ABBYY Aligner can be exported into TMX format, facilitating compatibility with any CAT tools or other systems designed to handle TM databases
All this makes ABBYY Aligner an indispensable tool for translators, language service providers, corporations and all who involved in managing translation activities, allowing them to improve the quality of translations, increase translation speed,minimize expenses when working with translation agencies, and reduce the workload of translation departments.
Advantages of ABBYY Aligner:
Saves Time
The high quality of parallel text alignment, the convenient and intuitive interface and special functions for processing large volume of documents help to achieve quality results quickly and re-use them when making other translations. Using the TM base created with
ABBYY Aligner while working at new translations help to do the job in shorter terms.
ABBYY USA Marks 10-Year Anniversary
Milpitas, Calif., August 10, 2010 — ABBYY USA, a leading provider of document recognition, data capture and linguistic software, is celebrating its 10-year anniversary in the U.S., marking its leadership position in enterprise content management and document management markets. This also marks the 21-year anniversary for its parent company, ABBYY.
Since its beginning, ABBYY USA and its parent company have driven growth within the document management and data capture industries through product innovation and successful partner integrations, expanding the usability and mass market attention for these technologies. Through ABBYY USA’s efforts, capture accuracy has increased to help people manage an increasing amount of information; the market has expanded to customers that span a variety of industries including government, finance, legal and healthcare; and technologies are consistently evolving to leverage new consumer devices bringing added value to applications.
New Web Portal to Provide Document Conversion and Language Services
Moscow, Russia (July 15, 2010) — ABBYY, a leading provider of document recognition, document capture, and linguistic technologies and services, today announced its new Web portal http://www.abbyyonline.com/ designed as a central access point to key services and technologies provided by ABBYY. The portal contains FineReader Online, an online OCR (optical character recognition) and document conversion service for transforming images of documents and PDF files into DOC, XLS, TXT, searchable PDF and other text formats. The new portal also delivers a range of language services including Lingvo Online dictionary, translation and telephone interpreting from ABBYY Language Services, and the Aligner Online tool for translators and language learners.
“We are proud to bring ABBYY’s flagship OCR and linguistic technologies online. Online services are easily accessible regardless of operating system, computer configuration, and many other factors. The result is that many more people can learn the benefits of our technologies,” explained Maxim Mikhaylov, vice president of sales and marketing for ABBYY. “ABBYY has been testing the technologies and is already hosting hundreds of thousands of people to the online services each day. We hope to further enhance this online platform with additional services and features.”
Currently, ABBYY Online offers to its visitors the following services:
Online OCR and Document Conversion at FineReaderOnline.com — The service converts scanned or photographed images of documents (e.g. JPG, TIFF, DjVu and others) and PDF files into DOC, RTF, XLS, searchable PDF, and TXT formats. FineReader Online is based on ABBYY OCR technology that has received international acclaim from industry experts and media for superior accuracy of text recognition and layout retention. It accurately reads texts in 37 languages including documents with Latin, Cyrillic, Armenian, and Greek characters, and supports recognition of multilingual and multi-page files. In addition, FineReader Online process documents with any combination of popular fonts and accurately re-creates formatting elements such as bulleted and numbered lists, columns, and tables.
ABBYY FlexiCapture Engine 8.0 Is Named Recognition/Data Capture Product of the Year 2009
Munich, Germany (17 November, 2009) — ABBYY, a leading provider of document recognition, data capture and language software is pleased to announce that ABBYY FlexiCapture Engine 8.0 was recently awarded the prize “Recognition/Data Capture Product of the Year” at the Document Manager Awards 2009. Document Manager is a UK-based publication dedicated to addressing the key issues behind successfully implementing document management, content management, workflows and e-business solutions.
“It is an honour to be recognized specifically for FlexiCapture Engine and an acknowledgement of the hard work by the UK team and ABBYY as a whole,” noted Jupp Stoepetie, ABBYY Europe CEO. “Thanks go to Document Manager Magazine for hosting an exceptional event as well to all of the members of the industry who voted for FlexiCapture Engine.”
ABBYY FlexiCapture Engine 8.0 is a software development kit (SDK) for integrating data and document capture technologies in Windows-based applications. It is the first comprehensive data capture SDK to combine technologies and tools for processing forms, semi-structured and unstructured documents, data verification, document classification, and export for backend processing and archiving in a single developer environment. The new ABBYY SDK enables developers, ISVs and service providers to empower their own products and services with industry-leading technologies already proven in thousands of real-world projects.
ABBYY Announces Its First Dedicated Solution for Document Scanning
Moscow, Russia (October 1, 2009) – ABBYY, a leading provider of document recognition, data capture and language software, today announced that ABBYY Scan Station, its new software solution for batch document scanning, is now available in the CIS countries and Eastern Europe. ABBYY Scan Station enables scanning and export of hundreds of pages within minutes and provides a wide range of useful tools for enhancing quality of scanned images. It can be used for capturing and digitizing large document archives as well as for scanning small amounts of documents with a standard workgroup scanner. In addition, organizations with regional offices and subsidiaries can easily deploy a distributed document capture system, having ABBYY Scan Station installed in different locations, with centralized document processing and storage.
With its intuitive interface, ABBYY Scan Station is easy to operate and navigate even if the operator has no or little experience in document scanning. At the same time, it is a professional solution based on powerful scanning technology that has been successfully used in the leading ABBYY products such as ABBYY FineReader, ABBYY FormReader, and ABBYY FlexiCapture.
“More and more companies are moving to paperless environment to streamline document processing and business operations”, said Aram Pakhchanian, Director of Data Capture Products Department at ABBYY. “The first step in this process is to organize fast and convenient document scanning. We believe that ABBYY Scan Station, which is highly productive yet very simple to use, will help even small companies take this step easily”.
ABBYY Software and Fujitsu Canada Announce New Distribution Agreement
MILPITAS, CALIF., August 27, 2009 — ABBYY Software, a leading provider of document recognition, data capture and linguistic software, has signed an agreement with Fujitsu Canada to distribute a full range of ABBYY products alongside Fujitsu Canada's complete line of document scanners including ABBYY FineReader®, FlexiCapture®, FormReader®, PDF Transformer™ and Recognition Server™. Currently, ABBYY's TouchTo optical character recognition (OCR) technology is integrated into the Fujitsu fi-6010N iScanner.
"With the growing demand to convert paper into searchable format, the combination of our industry leading scanners and ABBYY's award winning OCR software provides the perfect solution," said Steve Oblin, senior marketing manager, Imaging Products at Fujitsu Canada. "Initial feedback from our resellers has been very positive and several have already become certified on ABBYY's products."

