Replace Pioneer Home   All Examples   Free Download

 New request --free  RSS: Replace Pioneer Examples

1383.Character encoding -- How to find files with specified encoding and convert to utf8?

User: ting -- 2016-11-15          << 1382  1384 >>
Hits: 745
Type: Character encoding   
Search all Character encoding examples
Description:
how to change encoding of subdirectories of files to utf encoding,it is difficult to detect them all ,some has various encoding, e.g. Utf, utf-bom,big5 ,gb2312, ansi or ascii, ( do not know their encodings actually,)
I tested the requests example given by yours site,
but the already utf files will convert to unreadable text,
as some file already is utf or utf-bom( I do not know which one is utf or not)

is there a way to auto exlude those utf and utf-bom files (unconvert),then batch convert those
Answer:
Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps.
It's not possible to decide the exact encoding type of a file, except that some like utf8-BOM files can be detected by BOM mark at the beginning of file.

So you must know the input encoding and output encoding.

One way to find all files with a specific encoding, such as big5, and convert to UTF8:
1. set "settings->encoding" menu to 'big5'
2. open "tool->batch runner" menu, and click "search files", set "search directory" to your root path, and tick "search subdirectory"
3. set "contain word/phrase" to a specific word that exist in all your files, and click "search"
4. click "close", all files with desired encoding will be listed.
5. click "Encode Convert", and set "input encoding" to "big5", and "output encoding" to "UTF8", done.

Similar Examples:
How to batch rename files to the specified pattern found in file content? (61%)
How to rename file by sequence id with specified pre-defined order? (58%)
How to convert many text files from encoding utf8 to utf8-BOM? (57%)
How to shuffle all elements in a specified range in text file? (56%)
How to move all files to specified sub-folders according to their extension? (56%)
How replace the words with specified format by adding a number? (55%)
How to extract from a file according to content of another file? (55%)
How to extract all lines with specified date range from text file? (55%)

Check Demo of Character encoding
Keywords:
big  231  312  detect  ansi  ascii  asci  exact  directories  utf bom  convert ansi encoding to utf  detect encoding of text files  find files of type batch file  how to find the encoding type of a file  batch convert to bom utf8  batch file find files of type  detect encoding of text file batch file  batch file search subdirectories