Click here to Skip to main content
15,888,527 members
Please Sign up or sign in to vote.
1.00/5 (3 votes)
See more:
My purpose is to compare two resumes...have a criteria that if xyz fields match it is a duplicate.

As u may know resume styles differ.How do i understand that the name field is a name field,so i can store it somewhere and compare it with the same field in another resume.
As of now i have used Interop method and i am getting all document content in a string from the string i am spliting all the \t ,\r and empty spaces i am getting in an array.From the array how to get my own standard xml format like below.
<CANDIDATE_FULL_NAME>CandidateName here</CANDIDATE_FULL_NAME>
<CANDIDATE_FIRST_NAME>CandidateFirstName here</CANDIDATE_FIRST_NAME>
<CANDIDATE_LAST_NAME>CandidateLastName here</CANDIDATE_LAST_NAME>
<PRIMARY_EMAIL_ID>name@gmail.com</PRIMARY_EMAIL_ID>
<PHONE_BASIC>+919720018454155</PHONE_BASIC>
<DOB>8/2/1987</DOB>
<STREET1></STREET1>
<STREET2></STREET2>
<CITY></CITY>
<REGION></REGION>
<COUNTRY></COUNTRY>
<PIN></PIN>



i googled and i have got the following link.
http://recruitplusapidemo.onlineresumeparser.com/Default.aspx[^]
Posted
Updated 29-Apr-13 3:58am
v4
Comments
Pheonyx 29-Apr-13 7:31am    
How are you handling different word documents? Some might be .Doc and others might be .Docx, if they are .Docx you could look at the open xml approach where you extract the xml aspect of the document?

As for actually parsing the data to find what you are looking for, I'm not sure.
Manfred Rudolf Bihy 29-Apr-13 7:46am    
"unable to proceed further": This lets us assume that you've made any progress at all. Please be so kind as to tell us what you have achieved so far and where exactly you are stuck now.
"Kindly suggest me the same": This makes no sense whatsoever in the context of your question.
"My document is not a specific order": Well if it is one document, why go through the trouble of reading it in an automated fashion?
" using Natural Language processing": Try it!
Richard MacCutchan 29-Apr-13 11:35am    
Please do not post the same question in multiple forums.

You should go for third party tool or API that provide resume parser service. It takes resume file as input and give parsed resume in XML.

I used www.rchilli.com/[^]
 
Share this answer
 
Ganesh
if you have the data points of resumes with you after parsing, you can use them and run your matching algorithm.

I can help you if you post your exact question / work flow.

We are using RChilli, and found their team best in resume parsing and responding to our queries.
- S
 
Share this answer
 
Ganesh, I am in this industry from last 12 years. And worst part is - every resume is written in different format. And these formats keep changing with new batch coming in.

I seen only few companies are there in world, And my last working with Infosys and Wipro, even they are using 3rd party tool for resume parsing.
I recommend www.rchilli.com as we used them in our corporate website and working perfect.
-S
 
Share this answer
 
Machine Learning, Dot Product, Gaussian and Bayesian Algorithms. - Ring any bell ? :) I can understand if it doesn't, but I would suggest implement your own classes to identify the pattern of incoming resumes, measure and learn the gaps from the differences captured, if you don't want to go with 3rd party compiled apps or APIs.

Cheers,

VC
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900