Jump to content

Sky Slate Blueberry Blackcurrant Watermelon Strawberry Orange Banana Apple Emerald Chocolate

UniConv - Convert unicode [CMD]


  • Please log in to reply
9 replies to this topic
BoBo
  • Guests
  • Last active:
  • Joined: --

The 32-bit binary of Basis Technology's UNICONV utility, which converts between most East Asian code-sets and Unicode.

uniconv.inf (2633 bytes) Some brief explanation from the "help" printouts.
[uniconv.txt] (7037 bytes) The words from the "help" screen.
[uniconv.zip] (726772 bytes) The Windows NT/95/98 binary and DLL.
uniconv_old.exe (835072 bytes) The previous Windows version (no DLL)

The Sun (Solaris 2.5), HPUX and Macintosh binaries are available too.

Uniconv Help

------------------------------------------------------------------------

Uniconv is a command line utility that uses the Basis Technology C++
Library for Unicode for converting text between encodings and optionally
applying transforms to it.

Usage :
Uniconv will convert a text file written in a given encoding (click here
for accepted encodings) to another of its accepted encodings. It uses a
command line interface, the usage being as follows:

uniconv [-options]

[property | transform]*

uniconv
Name of the program to run.

input-encoding required
List the encoding of the input file. Encoding name must be
written in the way listed below.

input-file required
List the name of the file (if in the current directory) or the
path and file name of the file (if not in the current directory)
to be converted.

output-encoding required
List the desired encoding of the ouput file. Encoding name must
be written in the way listed below.

output-file required
List the name of the file to be created in the new encoding (if
in the current directory) or the path and file name of the new
file (if not in the current directory).

property optional
Returns true or false value for characters. A property is
associated with the transform that follows it. Properties not
followed by a transform are ignored. Multiple property-transform
pairs are OK. Multiple properties per transform are also OK. See
Character Properties for more information about how to use
properties, and see below for a quick reference of the
properties available.

transform optional
Changes a property value for designated characters in a file.
Multiple transforms are OK. See Transforms for more information
about how to use transforms, and see below for a quick reference
of the transforms available.

options:
Use these flags at the beginning of the command line, before you
specify the input and output encodings and filenames.
-debug optional
This option will print messages generated by Auto-detect. For
example, if you are converting a Japanese file and the input
encoding is japaneseautodetect, uniconv will list the encodings
it is attempting (sjis, euc-j, etc.) and the results.

-help optional
Displays the copyright information.

-subst optional
Allows you to change the default substitution character. The
substitution character is the character that is used if there is
no direct mapping between characters in a conversion. The
default substitution character is CTRL-Z.

Notes
- All command line arguments are case insensitive.
- Separate properties and transforms with a space.
- If there are multiple properties or transforms, they will be
performed in the order listed.
- The options -debug, -help, -subst, if used, must directly
follow "uniconv".
- * means more than one property or transform is OK.

Encodings :
Quick Reference: Accepted Encodings
Arabic, ASCII, Big5, BMP, ChineseAutoDetect, cp1251, cp1252, cp437, cp850,
EUC-J, EUC-KR, GB2312, Greek, Hebrew, HZ, ISO-2022-JP, ISO-2022-KR,
ISOLatinCyrillic, JapaneseAutoDetect, JIS_X0201, JIS_X_0208,
KoreanAutoDetect, Latin1, Latin2, Latin3, Latin4, Latin5, Latin6,
Shift-JIS, Thai, UCS2, Unicode11UCS2, Unicode11UTF7, Unicode11UTF8, UTF7,
UTF8

Accepted Properties:
UppercaseLetter, LowercaseLetter, TitlecaseLetter, ModifierLetter,
OtherLetter, AnyLetter, NonSpacingMark, CombiningMark, DecimalNumber,
OtherNumber, DashPunctuation, OpenPunctuation, ClosePunctuation,
OtherPunctuation, MathSymbol, CurrencySymbol, OtherSymbol, SpaceSeparator,
LineSeparator, ParagraphSeparator, ControlCharacter, OtherCharacter,
UndefinedScript, GeneralScript, Latin, Greek, Cyrillic, Armenian, Hebrew,
Arabic, Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu,
Kannada, Malayalam, Thai, Lao, Tibetan, Georgian, HangulJamo, Hiragana,
Katakana, Kana, Bopomofo, CJKUnifiedIdeographs, Hangul, UndefinedWidth,
Fullwidth, Halfwidth

Accepted Transforms :
ToLowercase, ToUppercase, ToFullwidth, ToHalfwidth, ToHiragana,
ToKatakana, Decompose, Compose, ToCombiningMark, ToSpacingMark, Select,
Filter, ToCRLF, ToCR, ToLF, ToParagraphSeparator, ToLineSeparator,
ToCanonical, ToTraditionalChinese, ToSimplifiedChinese, RomajiToHiragana,
RomajiToKatakana, KanaToRomaji, ToLatinNumber, SGMLEntity



bnbn2000
  • Members
  • 6 posts
  • Last active: Dec 13 2012 07:15 AM
  • Joined: 10 Jul 2006
another similiar tool:

<!-- m -->http://www.gbordier....ngconverter.htm<!-- m -->

heresy
  • Members
  • 291 posts
  • Last active: Sep 26 2008 10:47 PM
  • Joined: 11 Mar 2008
i didn't tested it yet.
but sounds sweet
it will be placed top of my list until ahk supports unicode
thanks BoBo and bnbn2000 too

Penky
  • Guests
  • Last active:
  • Joined: --
Bobo,

after days of searching, UNICONV is just exactly what I was looking for!!!!! I needed a tool to batch convert UTF-8 txt files (spooled from an oracle database) to Unicode.

THANKS SO MUCH ITS WORKING PERFECT!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

BoBo³
  • Guests
  • Last active:
  • Joined: --
You're welcome 8)

Convert .ahk to .exe
  • Guests
  • Last active:
  • Joined: --
it doesant work for me when i want to convert my script from .ahk to .exe using your program it just open and exit program right after start so i cant inser any commands
maybe its because iam using Windows 7 :?:

AK
  • Guests
  • Last active:
  • Joined: --
Really useful program. Exactly what I needed. Worked nice and it's fast. Converted 2g file in under a minute. BTW worked fine on Win7 for me.
Thanks!

  • Guests
  • Last active:
  • Joined: --
uniconv -out C:\HOME\MAK1.PRN -in C:\home\PP.PRN -decode big-5 -encode utf-8

can you help me how to convert thailand text file to postscript file

  • Guests
  • Last active:
  • Joined: --
Google for Ghostscript and install it, then you should be able to create postscript files.

Billy
  • Guests
  • Last active:
  • Joined: --
Hi, is there a way (or can you please add it as an option) of a batch conversion?
Say if I want to covert all the files in the given directory?
Thanks!