Completado

Adding (inverted index) algorithm to my WebCrawler written Java project and modifying some methods

I have a WebCrawler project files written in Java. My project requirements are listed below:

Please read before you apply:

*********************************

1- I have a method in my project that calculate URLs waiting time inside frontier (I am using Priority Queue as frontier to store parsed URLs inside it). I am using (Maximum Waiting Time) for every URL, when a URL is reached this Max time it will dropping. I Need to calculate the position of dropped URL inside PriorityQueue.

2- I need to add some methods that stores (downloads) the crawled URLs (WebPages) on local disk.

3- Writing some class in (Java) to apply (Inverted-Indexing) on stored (downloaded) WebPages.

NOTE: You will maybe need to add an extra classes (or import them to project) to clean HTML files (like: stemming, stopwords etc.) before applying (Inverted-Index) algorithm on downloaded WebPages.

4- You will store the indexed WebPages on database. I prefer using MySQL to do it.

5- Need a simple GUI for making search and show the results on screen for end-user.

NOTE:

*********

- Please DO NOT apply if you don't have idea about indexing or you did not worked on WebCrawler or indexing WebPages before.

- The project in written in Java so any advice to change the project programming language will not be acceptable.

- Thank you for you interesting

Regards :)

- I will need fully commented code.

Habilidades: Extracción de datos, J2EE, Java, MySQL, Extracción de datos web

Ver más: writing a programming language, the java programming language, simple search algorithm, search algorithm in c, queue algorithm in c, programming methods, priority queue algorithm in c, priority algorithm in c, making an algorithm, java programming classes, java indexing, java database programming, index search algorithm, gui programming java, clean programming language, a search algorithm, apply algorithm, algorithm for search, algorithm for priority queue in c, genetic algorithm project report code java

Información del empleador:
( 7 comentarios ) Ankara, Turkey

Nº del proyecto: #9046803

Adjudicado a:

khannanav

We have a team of expert JAVA developers and DB Architects with more than 10 years of rich industry experience & would be more than happy to work on the project. We have delivered a web crawler project last week on Más

$231 USD en 5 días
(9 comentarios)
5.4

7 freelancers están ofertando el promedio de $250 para este trabajo

mantislin

Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi

$250 USD en 5 días
(244 comentarios)
7.3
wangbeizou

A proposal has not yet been provided

$257 USD en 5 días
(93 comentarios)
5.9
tumakha

I am Senior Java Developer with more than 10 years of experience in Java design and development with strong problem solving skills. [login to view URL]

$500 USD en 10 días
(15 comentarios)
5.3
anuragiitk

I am an IITK graduate, 9 year experienced software professional and I have got top notch developers in my team, who have got experience across a span of technologies. The members in my team have worked with top notch t Más

$155 USD en 1 día
(21 comentarios)
5.4
owenzhu

master degree in computer, java professional, solid background in information retrieval and data mining

$222 USD en 7 días
(5 comentarios)
3.1
jacobuppalapati

Having 9 years of experience in java, j2ee technologies. Expertise in frameworks like spring(DI,security,MVC,AOP,boot etc),JSF(PrimeFaces,RichFaces), STRUTS Expertise in htmtl5,CSS3,javascript,jQuery,JQuery Mobile,A Más

$133 USD en 7 días
(0 comentarios)
0.0