UNICODE Converter
Unicode (unique, unified, universal encoding) is a character encoding standard used for the representation of
text for computer processing.
All symbols, numbers, characters, mathematician notation from all languages are assigned a code
for example: U+0041 for English letter A. There are different coding schemes eg ASCII which uses 1B where
UNICODE gets up to 4B for each character. They store letters and other characters by assigning a number
for each one. Before Unicode was invented, there were hundreds of different encoding systems for assigning
these numbers. No single encoding could contain enough characters. If you have not istalled the stylish urdu font then you may
download it by using
Download Urdu Font (Jameel Noori Nastaleeq Regular)
Unicode Converter.
Convert to Unicode
What is character encoding?
Computer stores text and numbers in the form of code and then transfer to text and numbers again.
The Unicode standard defines such a code by using different character encoding scheme form delivery
of same information.
All kinds of character encoding schem do the same for assigning a number to a character. eg:
ASCII Character Table
Letter |
ASCII-CODE |
Letter |
ASCII-CODE |
A |
065 |
a |
097 |
B |
066 |
b |
098 |
C |
067 |
c |
099 |
D |
068 |
d |
100 |
E |
069 |
e |
101 |
F |
070 |
f |
102 |
G |
071 |
g |
103 |
H |
072 |
h |
104 |
I |
073 |
i |
105 |
J |
074 |
j |
106 |
K |
075 |
k |
107 |
L |
076 |
l |
108 |
M |
077 |
m |
109 |
N |
078 |
n |
110 |
O |
079 |
o |
111 |
P |
080 |
p |
112 |
Q |
081 |
q |
113 |
R |
082 |
r |
114 |
S |
083 |
s |
115 |
T |
084 |
t |
116 |
U |
085 |
u |
117 |
V |
086 |
v |
118 |
W |
087 |
w |
119 |
X |
088 |
x |
120 |
Y |
089 |
y |
121 |
Z |
090 |
z |
122 |
what is ascii and unicode
UNICODE
Unicode Converter converts characters in UTF-16, UTF-8, and UTF-32 formats to
their Unicode and decimal representations. In addition, you can percent encode/decode URL parameters.
As you type in one of the text boxes above, the other boxe is used to hold converted symbols into equivalent code which is readable
for Urdu Inpage software.
ASCII
American Standard Code for Information Interchange is the first wide-range encoding scheme. It supports
upto 128 charcaters defines mostly English and Numeric common characters but bit liitle limited for the world.
So the Unified encoding standards were established for which is acceptable for all computers to transfer information without
any confussion. Unicode standard defines over 128,000 characters and can be seen at the Unicode Consortium.
It has several character encoding forms UTF-Unicode Transformation Unit:
-
UTF-8: Encode English characters widely used in email systems and on the internet. It uses 1 Byte (8 bits).
-
UTF-16: Encode the most commonly used characters with additional characters by a pair of 16-bit numbers.
-
UTF-32: UTF-32 is capable of representing every Unicode character as one number consuming 4 Bytes (32 bits).