I have a few .csv files (and one .sql file) that need to be entered into a mySQL database. Here what I need :
**- .csv cleaning**
The .csv files need a lot of cleaning, as they were created from crawlers; many pages were not crawled properly so you will end up with empty cells if you try to update the database with it.
**- Enter all .csv files into one mySQL database with the following but removing duplicates:
**
| Title | Body | Author | Source |
That's it! It should be pretty easy to do for a coder (manually cleaning and putting into a database takes too much time for me)
Happy bidding!
P.S. I'll be happy to give a bonus if the chosen coder can add a script once the project is completed (you siggest the functions) - For example: a script to add new articles, or remove non alphanumeric caracters, etc....
**UPDATE:**
The .sql file does not need cleaning :)
The .csv files are quite big so they cannot be sent through email
13 MB
16 MB
17 MB
152 MB
Just to make sure, "cleaning' only means remove data that is missing or in an incorrect format.
I am planning to open a new VPS account and upload the files to that server so the coder can work right from the host if needed.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
.cvs
mySQL