Question

You have been given the following code which represents some Unicode characters in UTF-8 encoding. Explain...

You have been given the following code which represents some Unicode characters
in UTF-8 encoding. Explain briefly how you detected the characters (note you are expected to
demonstrate this in binary/octal). Express the message contained as Unicode code points.
f0 9d 99 b0 ca a8 e2 9e 93 31

0 0
Add a comment Improve this question Transcribed image text
Answer #1

SOL)

Hey, beneath is the appropriate response portraying how to identify characters from the given UTF-8 encoding. Here the encoding is in Hexadecimal structure. If there should arise an occurrence of any inquiry please remark and furthermore please rate the appropriate response too. Much appreciated

Be that as it may, before begins the disentangling procedure let us initially comprehend what is Unicode and UTF-8

· A character in UTF8 can be from 1 to 4 bytes in length. UTF-8 can speak to any character in the Unicode standard. UTF-8 is in reverse good with ASCII. UTF-8 is the favored encoding for email and pages.

· Unicode is a character set. UTF-8 is encoding.

· Unicode is a rundown of characters with special decimal numbers (code focuses). A = 65, B = 66, C = 67, ....

· This rundown of decimal numbers speaks to the string "hi": 104 101 108 111

· Encoding is the means by which these numbers are made an interpretation of into twofold numbers to be put away in a PC:

· UTF-8 encoding will store "hi" like this (parallel): 01101000 01100101 01101100 01101111

· Encoding makes an interpretation of numbers into paired. Character sets makes an interpretation of characters to numbers.

Presently the inquiry is express the message contained in given Unicode codepoints with clarification.

F0 9d 99 b0 ca a8 e2 9e 93 31

Let us draw the table which contains the decoded character sets of above Unicode.

For instance The Unicode character of UTF-8 encoding f0 is ð (ETH).

Unicode character

Character name

Hex code point

Decimal code point

Hex UTF-8 bytes

Octal UTF-8 bytes

F0

ð

LATIN SMALL LETTER ETH

00F0

240

C3 B0

303 260

9D

OPERATING SYSTEM COMMAND

009D

157

C2 9D

302 235

99

Control

Function

0099

153

C2 99

302 231

B0

°


DEGREE SIGN

00B0

176

C2 B0

302 260

CA

Ê

LATIN CAPITAL LETTER E WITH CIRCUMFLEX

00CA

202

C3 8A

303 212

A8

¨

DIAERESIS

00A8

168

C2 A8

302 250

E2

â

LATIN SMALL LETTER A WITH CIRCUMFLEX

00E2

226

C3 A2

303 242

9E

ž

PRIVACY MESSAGE

009E

158

C2 9E

302 236

93

SET TRANSMIT STATE

0093

147

C2 93

302 223

31

1

DIGIT ONE

0031

49

31

61

Add a comment
Know the answer?
Add Answer to:
You have been given the following code which represents some Unicode characters in UTF-8 encoding. Explain...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT