Thanks and Sorry to say but it is not am looking for.
Hope you get that ,I want something resume parser.
I just want to extract email id and phone number.
If I say I wish to develop a resume parser,
I would go with below preocess:
1] Know file format.
2] Convert every file into one specific format.
3] Extract email and and mobile number with regular expressions from that file.
Hows this idea?
But need one help..
is there any (specific) format, which I was talking about ..
which will help me to extract things easily?
Like pdf to txt, doc to .txt, docx to txt
and extract from .txt file
Can anyone help me with this idea?
or any other idea?
There are many tools for converting files into text, but parsing them is something else entirely. Not one resume is identical, so you have to account for all variations of phone numbers and emails. If there are more than one, you'll need to store them all, or develop a way to find out if it belongs to the uploader.