Replace Pioneer Home   All Examples   Free Download

 New request --free  RSS: Replace Pioneer Examples

1232.Text file parser -- How to find from an article all words not appeared in a word list?

User: Blair Yu -- 2014-08-16          << 1231  1233 >>
Hits: 1317
Type: Text file parser   
Search all Text file parser examples
Description:
How to find from an article all words not appeared in a word list?
I'am wanna easily find the words that i haven't learned when i open the TXT or WORD or PDF documents.
Input Sample:
myword.txt
word1 word3 word5

input.txt
word1 word2 word3 word4 word5
Output Sample:
output.txt
word2 word4
Answer:
Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps.
First of all, prepare a file that contains all words you know, named d:\test\mywords.txt, then:
1. ctrl-o open the new txt article file
2. ctrl-h open 'replace' dialogue
* uncheck 'print unmatched unit' option.
* set 'search for pattern' to:

* set 'replace with pattern' to:

* set 'if=>' to:

* click 'advanced' tab, set 'run following at the beginning of replace' to:

3. click 'replace', all new words will be found
4. ctrl-s save to result to a new file, such as newwords.txt

Note: 
* in step 2, please use double backslash as folder delimiter such as d:\\test\\mywords.txt.
* the mywords.txt can be any english text file with any free format, for example: "This is for test, that is also for test." is equivalant to "This, is, for, test, that, also"

Screenshot 1:  Replace_Window


Screenshot 2:  Replace_Advanced_Window


Similar Examples:
How to find out in each line which words in the list appear? (56%)
How to find out all lines that appeared specified number of times? (56%)
How to capitalize all words except some stop words? (55%)
How to count number of words appeared in a text file? (55%)
How to delete all lines whose first word appeared in previous line? (54%)
How to count and sort the frequency of all words appeared in many text files? (54%)
How to find consecutive words with more than 4 characters after the word "a"? (53%)
How to filter out all web pages that does not contain specified words? (53%)

Check Demo of Text file parser
Keywords:
free  pdf  english  document  found  doc  backslash  double  slash  var  free format text file  parser for a english text file  free text  free replace text  text file of all english words  all english words text file  all words in english text file  all english words in text file