Introduction Our primary interest is improving the searchability of the 721 pdf documents saved on our site. On a different site that archives the same documents, [login to view URL]~ota, users can only search pdf versions of the reports alphabetically by title, alphabetically by predetermined keyword, and by year of publication. We want to expand this capability so that users can search using these tools, but also can search the text of the reports, report number, etc. We are definitely interested in hearing about the type of system you suggest and the price of such a system. Below please find a request for the next phase of our project. Let us know what suggestions you have on how to improve our project. Sincerely, otaarchive Request We are looking for someone to create and populate a database using php and mysql protocols. The database will provide index information for a series of 721 reports (saved in .PDF format). Once the database is built, a second request will be posted to build a search engine function to search the database, retrieve report matches, and display the matching pdf files. The data will be built on the following website [login to view URL]
## Deliverables
We are looking for someone to create and populate a database using php and mysql protocols. The database will provide index information for a series of 721 reports (saved in .PDF format). Once the database is built, a second request will be posted to build a search engine function to search the database, retrieve report matches, and display the matching pdf files. The data will be built on the following website [login to view URL] The tables should be MyISAM and use MySQL’s native Full-Text search functions for the Title, Summary and Table of Contents. The developer should include pagination-ready sample search queries that include the matching text from the query if the Title, Summary or Table of Contents is searched. For full-text searches that do not return any results, a second search using query expansion should be automatically and transparently performed.
To summarize, the coder will:
1) design the database
2) create a php form for adding data to the database
3) add all the requested data to the database
To populate the mysql database-
A) Identify all the files named [login to view URL] through [login to view URL]
B) For each PDF file, add the following information to the database.
1) Report title (in italics at top of the first page),
2) OTA catalog number (on the first page, only used between 1990 and 1995),
3) GPO stock # (on the first page, only used between 1992-1995),
4) NTIS order # (on the first page, used between 1974-1994), 5) Year of release (first two digits of the file name- for instance [login to view URL] refers to a report published in 1974; [login to view URL] refers to 1975; etc)
6) PDF file name
7) Summary
8) Table of contents
C) For each report [login to view URL], [login to view URL], etc, the summary and table of contents information to be added to the database can be found in the following links:
1) Document summary at [login to view URL]
2) Table of contents at [login to view URL]
## Platform
any