Click here to Skip to main content
15,887,683 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
am working on a project at school doing a search engine , my part is to code an indexer i wanter know how can i change pages that the crawler has crawled to html , on my search engine we will corvert all document to html , corvert pdf file , word , and so on to html , i dnt know where to begin
Posted
Comments
TRK3 14-Jun-12 13:44pm    
That seems a bit beyond the scope of a school project.

Do you really need to actually convert them to HTML?

Or is it sufficient to extract the words from them so you can index them?
Nokukhanya01 14-Jun-12 14:59pm    
I realy need to convert them to html , not extract words that how we design it on our design phase

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900