954,529 Members — Technology Publication meets Social Media
Username:
Password:
Lost login information?
Have something to say? Contribute New Article Reply to this Article

read no of occurence of word in a pdf using c#

Dear all,

im new to this forum.can u pls help me to do this one.
i am reading a particular pdf file and searching for a particular word and how time it occured means number of occurences in that file..
pls assist me to do this one

Thanx in advance..
Ram Krishna

ramsaryanmca
Newbie Poster
1 post since Oct 2006
Reputation Points: 10
Solved Threads: 0
 
Dear all, im new to this forum.can u pls help me to do this one. i am reading a particular pdf file and searching for a particular word and how time it occured means number of occurences in that file.. pls assist me to do this one Thanx in advance.. Ram Krishna




Do you already know how to read a pdf file? how the format, and so on?

regards,
Lok

lok_tan
Newbie Poster
8 posts since Nov 2006
Reputation Points: 10
Solved Threads: 0
 

I'd imagine you'd have to use some external pdf library which integrates with c# to convert the file into plain text.

Once that's done, you would just write a program as normal.

iamthwee
Posting Expert
5,950 posts since Aug 2005
Reputation Points: 1,543
Solved Threads: 439
 

The PDF format is exceedingly complex, and Adobe's developer tools are buggy and virtually unsupported. I would look at a 3rd party tool set such as those provided by PDF Tools AG . They are COM objects, so you'll need to create a COM wrapper.

I've also had partial success automating Acrobat's "search" function. However, Acrobat is not .NET compatible.

tgreer
Made Her Cry
Team Colleague
2,118 posts since Dec 2004
Reputation Points: 227
Solved Threads: 37
 

Yes, I agree with iamthwee and tgreer. That's why I'm asking you whether you already know how to read the pdf file or not. Since it's a big obstacle to solve first, before goes into your main destination :)

regards,

Lok

lok_tan
Newbie Poster
8 posts since Nov 2006
Reputation Points: 10
Solved Threads: 0
 

This article has been dead for over three months

Post: Markdown Syntax: Formatting Help
You