User: editor -- 2008-12-28 << 309 311 >> |
Hits: 4182 |
Type: Count and statistics |
Search all Count and statistics examples |
Description: |
How to count the frequency of different words in website like cnn.com? |
Answer: |
Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps. |
1. Open Replace Pioneer(2.3 and above) menu "Tools->Pattern Counter" 2. Select the source to file/http and enter web address: http://www.cnn.com 3. Select the "Counter Template" to: "Characters, Words, Lines" 4. Click "Count" button, the detailed number will be shown like: Chars(word) 5668 Chars(non-word) 2630 Words 1196 Lines 188 5. There are 1196 different words, select the Words row and click Detail, the occurance of the different words will be shown like: Number Words(\w+) ======= ==================================================== 24 to 24 CNN 22 the 15 of 14 in 14 for 13 Gaza 12 a 12 Video ... You will know that word "to" or "CNN" is the most frequent used words in cnn.com, which was shown 24 times. Please refer to following address for further instruction: http://www.mind-pioneer.com/replace/help/Pattern_Counter.html |
Screenshot 1: Pattern_Counter_Window |