I need a program that will download an all-text? PDF file and parse the? text in that file, then upload the fields? into a database.
This program must run on a shared hosted server and must be programmed in ASP.net and C#.? The C# files must be separate files.? The database is SQL Server 2008.? I also must receive the source code.
I use [login to view URL] as my hosting provider and they allow me to schedule page requests, so while I need this program to run daily, it is not important to build the scheduling into the program.
## Deliverables
The file that needs downloaded is located here:
<ftp://[login to view URL]>
the program should do the following when the web page is accessed:
1) check the above file on the ftp server and see if the date/time has changed since last time the file was downloaded
2) if it has changed, then download the file (otherwise, do not download just stop)
3) after downloading the file, parse the text in the PDF file page by page
4) at the top of each page (line 6) is listed a scheduled date.? record that date for the page.
5) parse the rest of the page into the following fields: case number, plaintiff, defendant, comments
6) write to a database table for each record the fields recorded in step 5 plus the date & time collected in step 4
7) run a report that shows all the data collected in an HTML table
8) create a concatenated link from a querystring i will provide to turn the following fields into hyperlinks: case number, plaintiff, defendant.? the links should open in new windows.
9) email the same file to email addresses i provide.
10) provide me with an "admin" page that allows me to change program options like smtp servers, email logins, passwords, email addresses to send to, etc.
any other questions with regard to functionality, please ask.