How to find the most frequent words in 4 GB txt?

Question

Konstantinos_1 0 Newbie Poster

11 Years Ago

Ι have many text files with total space 4 GB. In Russian, Greek and English languages.

Is there a way - program - software to find the most common - frequent words in these files?

I want it to produce a list ordered from most to least used words.

I know only C and Matlab. Thanks in advance.

windows-vista-7-8

3 Contributors
2 Replies
167 Views
17 Hours Discussion Span
Latest Post 11 Years Ago Latest Post by KushMishra

All 2 Replies

meta.quota 14 Newbie Poster

11 Years Ago

to reduce code lines, i'd suggest You use C++ <map> .. it's much more easier .. you just need a helper function named, maybe (split) which is a vector of string that takes as an argument a const string reference ...

get the sample code here

You just have to add your file handlers so instead of the above program getting input from the keyboard, it gets it from the specified file ...

******* HAVE PHUN C0DiNG *******

Ancient Dragon commented: Good advice +14

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

KushMishra 38 Senior Technical Lead · Answer 1 · 2013-12-23T06:24:30+00:00

Hi, I am not sure but probably a software called "Crawler" may fix your issue.
This crawler is used by many websites also like Google etc.
May be it would be of some help to you as well.

How to find the most frequent words in 4 GB txt?

Recommended Answers Collapse Answers

All 2 Replies

Recommended Answers