Member Avatar for kohkohkoh

I tried to search over this site and others as well.
But couldnt help me to solve the above matter.
Actually i want to do a searching algorithm in a PDF file.
I tried to use:

[EMAIL="%@Import"]Import[/EMAIL] Namespace="System.IO"%>

Sub page_load(ByVal Sender As Object, ByVal e As EventArgs)
Dim filetoread As String
filetoread = "C:\BuzznetAds30Mac07.pdf"
Dim filestream As StreamReader
filestream = File.OpenText(filetoread)
Dim readcontents As String
readcontents = filestream.ReadToEnd()
MsgBox(readcontents)
filestream.Close()
End Sub

but it gave me "dont know what characters".
I studied that the API for pdf is different. In order able to read the pdf, i required 3rd party's component to do so..

I would like to know, is there any other alternative? Vb 6.0, Vb.net, C# are welcomed.

Thank you.
Hope that you guys out there can help me...strrrrrugle for 2 weeks researching for this...

Recommended Answers

All 3 Replies

Member Avatar for iamthwee

PDf to text will probably involve some sort of OCR tool.

pdf is owned by Adobe but they have opened it for developers.But it is complicated and for simple needs it would seem third party libraries are better solution than developing a solution yourself.

Text Mining Tool can help you.
It is the free PDF, DOC, CHM, RTF, HTML converter. It performs analysis of pdf, doc, rtf, chm, html files and converts them to text.

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.