Understanding Character Encoding

To a computer, text characters are symbols. These symbols are assigned numbers (integers) in order to store these symbols in memory. Encoding is the system by which these numbers are assigned. Many different encoding methods arose to handle special characters and various languages. Issues can arise when translating or editing files with a different encoding from the one they were created with. Encoding methods are not always compatible and interchangeable.

Issues between encoding systems were the inspiration for the Unicode format. The goal of Unicode is to provide a unique number for each character, regardless of language, platform, or program. It does this by assigning letters to code points like U+#### where #### is a hexadecimal number. Within Unicode there are different methods (formats) for storing these unique numbers. A discussion of those various methods is outside of the scope of this article, but to keep things simple, UTF-8 is an efficient way to store the Unicode format and is considered the best practice for encoding.

Software applications used for creating websites may save with any of the various character encodings. It can be helpful to know where to check or how to change your file's character encoding in your software. This is an important enough issue that the W3C (the organization that develops Web standards) has information about setting character encoding in the most popular web design software:

Setting encoding in web authoring applications

Character encoding becomes especially important when you are editing a file. If you try to open a file with a different encoding than what it was created with, issues can occur in the display of characters. In our next article, we'll discuss the character encoding check in the cPanel File Manager and what to look for if you plan to edit a file in the cPanel File Manager.

Did you find this article helpful?

We value your feedback!

Why was this article not helpful? (Check all that apply)

The article is too difficult or too technical to follow.
There is a step or detail missing from the instructions.
The information is incorrect or out-of-date.
It does not resolve the question/problem I have.

How did you find this article?

Please tell us how we can improve this article:

Email Address

Name

new! - Enter your name and email address above and we will post your feedback in the comments on this page!

Did you find this article helpful?

« Prev

Character Encoding and the cPanel File Manager

Domain Masking

Name:
Email Address:
Phone Number:
Comment:

Submit	Please note: Your name and comment will be displayed, but we will not show your email address.

News / Announcements

Help Center Login

Don't want to login using Facebook or Google+?

Ticket:	Submit a Support Ticket
Chat:	Click To Chat Now

Knowledge Base

Community Q&A

Learning Corner

How To

Understanding Character Encoding

Post a Comment

News / Announcements

Help Center Login

Related Questions

Help Center Search

Current Customers

Ask the Community

Not a Customer?