cy163 0 Newbie Poster

Hello ALL,

I have a table contains words extracted from documents which are belonging to a number of categories. Each category contains a number of documents. For each category, I want to find out those words which occur at least in N documents belonging to this category.

Category    DocID       Word       Freq      Tag
Art             A12         Paint        3           verb
Art             A12        Exhibition    5           noun
Art             A12         Show        2           verb
Art             A12          Ticket       4          noun

Art             B34         Exhibition   2          noun
Art             B34         Musum       3          noun
Art             B34         Contemporary   2      adj
Art             B34         Show               1      noun
Art             B34         Visitor             2       noun


Art             E97         gallery             3      noun
Art             E97         Show               3      noun
Art             E97         Exhibition         2       noun


Electronics    A32        Circuit           2       noun
Electronics    A32        IC                 2       noun
Electronics    A32        Factory         2       noun
Electronics    A32       Electronics      3       noun

Electronics    F11        HiFi               2     noun
Electronics    F11        IC                 2     noun
Electronics    F11        Circuit           2     noun

Electronics   C55         IC                 3      noun
Electronics   C55         Circuit           3      noun
Electronics   C55         Transmit        2      verb
Electronics   C55         Receive         5      verb

I have no idea on how to do this query.

Thanks

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.