Data Processing/Scraping from Standard Format txt Files

En curso Publicado Oct 8, 2013 Pagado a la entrega
En curso Pagado a la entrega

Hi, we are looking to hire someone to manipulate already existing data files (will be given web link) that are in a standard .txt file format with numeric and text entries to a format used for computing.

1) We would like you to start with taking 100 of the entries (randomly selected with random number generator) in one of the 30 files we will give you.

2) We would like you to transform these 100 entries into a matrix in .csv form based on pre-specified categories given by us. Two of the columns are word and word count. Another is entry ID.

3) We also would like a sparse representation of the two columns of word and word count where there is a new matrix (rows are entry #, columns are word label - filled with the count) and that depends on size of file. We can talk about this.

4) The deliverable should be in manageable csv file sizes, which won't be a problem for this data...

But, we will definitely have more work if this is done successfully (over all files and more entries needed), so scalable routines are highly encouraged. Thinking about a million entries with a higher budget, if this goes well.

Thank you very much.

Please note that we will only hire someone who has the ability to do this automatically since we are looking for FUTURE work primarily. This is just a pilot.
Once we go from 100 entries to 1 million, manual typing will not work. We realize that file size will be an issue depending on the matrix, so if things eventually need to be broken apart into let's say 1000 files of 1000 entries, we will then use this with parallel computing routines for our computations. Thank you so much and we look forward to working with you.

Big Data Sales Entrada de datos Extracción de datos Procesamiento de datos Extracción de datos web

Nº del proyecto: #5006785

Sobre el proyecto

40 propuestas Proyecto remoto Activo Oct 9, 2013

40 freelancers están ofertando un promedio de $141 por este trabajo

jaylancer43

Hello - I am an expert techno-functional analyst having vast experience in lots of arenas of IT industry including Excel Macros. I am an Engineering Graduate with an MBA degree. If you see, I am among the niche bid Más

$111 USD en 3 días
(414 comentarios)
8.0
Toperfection

Dear "statsphd" Hope you are doing well. I have reviewed the project details and would like to offer our services. We have completed many Research/Data collection/Product add/Data mining assignments on [login to view URL] Más

$151 USD en 3 días
(168 comentarios)
7.8
uumairkhalid

Hi.. Expert web scraper/Data Minor here. Interested in your project. I assure you 100% accurate and good quality work. Regards

$105 USD en 3 días
(189 comentarios)
7.1
tjawad17

Hello Sir, We are a professional company specialized in Data Mining and Web Scraping. We have our own server, team and tools for data mining and scraping efficiently and accurately. We can parse your given text Más

$155 USD en 4 días
(165 comentarios)
6.9
happy2helpp

Respected sir, We saw project description and got complete idea about project. We are expert in Big Data, Data Entry, Data Mining, Data Processing and Web Scraping!!! We have worked on many similar tasks before and Más

$231 USD en 4 días
(84 comentarios)
6.9
diamond247

Hello Sir, We are a big set up company with excellent skilled operator who have a lot of experience in this segment, our employee complete more than 300 similar job, i have gone through your project specification, i Más

$144 USD en 3 días
(243 comentarios)
7.1
ashok7925

Hi, I am much interested in this work. Please share me more details with sample text file and describe me what would like to do. I can automate all of the process once I get understood your requirement. Please sha Más

$100 USD en 3 días
(33 comentarios)
5.3
elMancha

Hello there. I have high Excel and Visual Basic skills with great professionalism. I study electronics and computer engineering at Oporto university and I'm looking for work to fill the blanks on my schedule. I' Más

$60 USD en 3 días
(40 comentarios)
5.0
arvt

Hi I'm interested and I like to know more details about your project to bid accordingly. I have experience doing programs and scripts in some projects here and in other freelancer site. I have Skype, Gtalk, MS Más

$35 USD en 3 días
(12 comentarios)
4.9
mohanlg

Hi, I am interested to do these project work. Expert in data conversion work. Please send me more details of work to start. Thanks sunny

$35 USD en 2 días
(25 comentarios)
4.3
RajakScripts

Hi, Please attach the .txt file AND a matrix in .csv form based on your given pre-specified categories for a review, so I can adjust my bid & delivery time precisely. Yes, I aware that you want this to be perform Más

$88 USD en 3 días
(7 comentarios)
4.3
gokhanonal

Dear Sir / Madam, I'm a computer engineer (with BS Degree), working freelance in Istanbul, Turkey. I can complete your project as fast & accurate. Please let me know. Looking forward to hearing from you soon, Más

$35 USD en 1 día
(13 comentarios)
3.6
signo

Hello, I am experienced in working with large files and back-end processing in general. I will definitely finish this project in the next 24 hours. I still need some clarifications before getting started, regardi Más

$133 USD en 1 día
(32 comentarios)
4.2
thanhhungqb

Dear sir, I have read your requirement carefully and interested in it. I am expert on data entry, data scrapping and process data. I usually to do it automatic. For your project, I think I can automatic by a prog Más

$126 USD en 3 días
(15 comentarios)
3.6
sunil440

Good day! I would like to submit my application as Data Collector. I shall be pleased to consider me as a qualified applicant.I believe my qualifications would make me an outstanding asset to your organization. I woul Más

$100 USD en 3 días
(16 comentarios)
3.4
GurpreetSngh220

Hi, I am very much interested in your project. I would like to discuss with you more regarding the project. You can rely on me because i am serious on my work and not sitting here to waste time (both of us). you Más

$188 USD en 5 días
(7 comentarios)
3.2
FernandoCanizo

Hello, I'm interested, I'd to give it a try. Can you provide a sample file so I can send you my attempt? No compromises. Also send me any other information I should need to build a proper processing script, I'm t Más

$30 USD en 2 días
(2 comentarios)
3.4
inoussakabore

Hi i have almready do this kind of job. You can see that in my profile. I am ready to start it. I can do that in about one week.

$250 USD en 7 días
(3 comentarios)
3.3
igors233

Greetings, I'm professional software developer with 15+ years of experience in similiar tasks. I will produce a standalone exe (no dependecies) that will take as input given txt file (it could be downloaded automatical Más

$147 USD en 10 días
(4 comentarios)
3.5
szymszteinsl

Hi! I am professional C/C++/C#/Java programmer. I can do this project with highest quality, Best Regards, Szymszteinsl

$144 USD en 3 días
(2 comentarios)
3.3