Extracting Metadata And Structure
Project sponsored by DTIC, GPO, and NASA
DELIVERABLES FOR DTIC PHASE IIIA
  1. Enhanced Portability
    1. Removal of system dependencies - Software upgrade - Dec 1, 2007
      Package and Readmefile-(Version 3.2) Jan 10, 2008
    2. Documentation for non-technical staff - Operational manual - Nov 1, 2007
      Installation instruction manual-Last Updated December 5, 2007
    3. Improved Error Logging - Software packaged - Dec 1(Jan 1), 2008
    Due to delayed start we had to shift the deliverables under this item to Jan 1, 2008
  2. Standardized output
    1. Process extracted metadata into standard format,Software upgrade and Report - Jan 1, 2008
  3. Templates for 'Large contributors'
    1. Enhanced Testbed, (internal no deliverable) - Dec 1, 2008
    2. Engine enhancement development, Design and development report - Jan 1, 2008
    3. Template development,Template set - Feb 1, 2008
  4. Prime OCR
    1. Write module to convert Prime OCR to IDM, Software upgrade - April 1(June 30), 2008
      TextPDF support development- Status Report, IDM generator and ReadMe
    Our current approach is to work on text pdf task (item 2) of phase 3b in parallel with this task as a text pdf module will allow us to solve it for other OCR softwares as well.
  5. Output Processing
    1. Metadata Validation, Software update - March 1, 2008
    2. Standardized Output, Software update - April 1(June 30), 2008, described in report
    NOTE: Due to the problems with the initial start, we had to change the dates for deliverables 4A & 5. B to June 30.
MONTHLY REPORTS

Old Dominion University Digital Library Group. extract@cs.odu.edu