Fix OCR Errors and Add Formatting to Word Documents

I am looking for help with formatting Word documents that came from converted PDF files. You are not expected to proofread the documents, ie fix errors that appear in the original PDF. You are only expected to fix errors that were created by OCR program. So if there is a typo or grammar mistake in the original, it should be left as-is in the final product.

IMPORTANT: in your offer ONLY provide:

1.a FLAT FEE quote for the completion of this project; and

2. the amount of time it will take you to complete the project from the date of assignment.

You will be provided with the original PDF files and the Word documents generated from them. The Word files have errors generated in the course of the OCR process. The following steps need to be completed:

1) correct any typos by comparing the Word document with the PDF original – especially paying attention to those parts of the PDFs that are covered by stamps (most errors are highlighted in blue, but you still need to review the rest);

2) indent all quotes and lists (1") - and make sure no other text is indented;

3) Remove colons from the end of the headings

4) make sure that all headings are capitalized as headings - Capitalize *First* Letters of All Words Except Articles (a, an, the), Coordinating conjunctions (and, but, or, nor, for, so, yet) and short prepositions (in, on, at, to, by, of, for, as, up)

5) accurately assign headings 1, 2, 3, 4 to headings;

6) replace numbers referencing footnotes in the body of the documents with [FN#];

7) list footnotes at the end of the document, starting with [FN#], and add "Footnotes" (styled as Heading 1) before the list of footnotes;

8) replace all curly quotes with straight quotes, and make sure that single quotes are *only* used inside words, i.e. use double straight quotes for all quotes;

9) do not use automated paragraph numbers – paragraph numbers should be in the format of 9.[space] (i.e. not 9.[tab])

Once you are done correcting errors format the documents as follows:

1) add an empty line after each paragraph and after each bullet/number point;

2) make sure the font Aptos 12" throughout

3) make sure all text is single space (nil spacing before and after paragraph)

4) make sure headings are formatted as follows: Heading 1: Aptos 12" bold CAPS, Heading 2: Apto 12" bold; Heading 3: Aptos 12" Italics underlined.

5) make sure no text is highlighted

I am attaching a sample for your consideration. Please note that while most files are OK quality, there are a few files that are not.

This project involves the following:

• 35 files in English (841 pages);

• 16 files in French (434 pages);

• 2 files in Portuguese (14 pages);

• 1 file in Arabic (66 pages).

Please provide a preliminary quote. Shortlisted contractors will be sent all of the documents to provide a final quote.

IMPORTANT: You are expected to complete the whole set, i.e. no payment will be made for a portion of the set, because it costs a lot more to hire someone to do a small set of documents.

Back to blog