I have a computer book price comparison website, and I need more ISBNs for my webscraping programs to check, grab descriptions, and harvest. Our site is a little different in the fact that we scrape and store prices instead of looking them up when the user requests them. Right now I have a list of about 24,000 ISBNs (which will be made available to you after you bid and win the auction so as to help avoid duplicates). Here's what I want: 200,000 isbns (or more if you are able to get them - I will bonus the coder appropriately depending on results) of computer and technical-related books. No duplicates inside the set. You can have duplicates of my data, if it can't be avoided. 10 digits, no dashes, uppercase 'X's (when encountered). One ISBN on each line. I'm a perl scripter/linux geek, so I can clean up your results, but I would prefer if you clean them up before turning your results in. Here's a little example data: 0596000006 0596000022 0596000030 0596000057 0596000065 0596000081 059600009X 0596000103 0596000111 059600012X This is what I expect the results to look like. If you don't know what I'm talking about, please don't bid. Deliverables: ISBN list, newline delimited, no dashes, uppercase 'X's, please. No duplicates in delivered set. If you can provide a perl webscraping script that will find new ISBNs for computer and technical books, tell me and we'll talk a deal (via bonus on rentacoder). Perl for linux preferred (to work on my dedicated Fedora Core 1 hosting box). (And yes, I can install CPAN modules on this box as root.) On delivery of list, I will need a day or two to double check your results. I work full-time as a computer programmer, 8-5pm CST, so please keep this in mind. If you have any questions, please ask before bidding.
## Deliverables
Rent A Coder requirements notice: As originally posted, this bid request does not have complete details. Should a dispute arise and this project go into arbitration "as is", the contract's vagueness might cause it to be interpreted against you, even though you were acting in good-faith. So for your protection, if you are interested in this project, please work-out and document the requirements onsite.
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
text file