How to use the UNICHAR function
What is the UNICHAR function?
The UNICHAR function in Excel returns the Unicode character corresponding to a given Unicode code point number.
Table of Contents
1. Introduction
What is UNICODE?
Unicode provides a universal character set and encoding schemes to represent text consistently across computing technologies and interfaces. It supports internationalization and localization needs which enables global websites and apps to display text and accept input in any language script like Arabic, Chinese, Hindi etc. without restrictions.
- Provides a unique number for every character, regardless of the platform, device, application, or language.
- With the capacity to represent over 1.1 million characters, Unicode encompasses all the world's languages and writing systems in one unified encoding standard.
- Allows data exchange and processing of text in any language combination.
- Maintained by the Unicode Consortium and coordinated with ISO standards.
- Common Unicode encodings are UTF-8, UTF-16, and UTF-32, which provide variable length or fixed width representations.
What is a character set?
A character set is a defined collection of textual characters used to represent writing systems, symbols, and numbers. They provide a mapping between numeric codes and human-readable characters. Common character sets include ASCII, Unicode, UTF-8, Latin-1, etc.
What is the difference between the UNICHAR function and the CHAR function?
UNICHAR takes a Unicode code point as input and returns the Unicode character for that code point. CHAR takes an ANSI code value as input and returns the ANSI character (windows).
What is the difference between UNICODE and ANSI?
ANSI uses 8-bit encoding, unicode uses variable width encoding of 8-bit to 32-bit. ANSI is limited to English, unicode enables internationalization and localization. ANSI predates Unicode and is a legacy standard, Unicode is a newer standard.
The UNICHAR and the UNICODE functions let you convert between Unicode characters and their underlying code point representations.
2. Syntax
UNICHAR(number)
number | Required. The Unicode number for which you want the corresponding character. |
3. Example
UNICHAR function converts a Unicode code point value to its corresponding Unicode character.
Formula in cell C3:
This results in #VALUE! because 0 is not a valid Unicode code point.
Formula in cell C4:
The UNICHAR function accepts a numeric Unicode code point which is 65 in this example. For code point 65, the corresponding Unicode character is 'A'. UNICHAR(65) evaluates to 'A'.
Formula in cell C4:
Returns "A", the Unicode character for code point 65
Formula in cell C5:
Returns "Z", the Unicode character for code point 90
Formula in cell C6:
Returns "a", the lowercase "a", the Unicode character for code point 97.
Formula in cell C7:
Returns "z", the lowercase "z", the Unicode character for code point 122
Formula in cell C8:
Returns "ÿ", the Unicode character for code point 255
Formula in cell C9:
Returns "ř", a Czech letter, the Unicode character for code point 345
Formula in cell C10:
Returns "ʎ", a phonetic symbol, the Unicode character for code point 654
Formula in cell C11:
Returns "β", the Greek letter beta, the Unicode character for code point 976
Formula in cell C12:
Returns "Ұ", a Cyrillic letter, the Unicode character for code point 1200
Formula in cell C13:
Returns "ڬ", an Arabic letter, the Unicode character for code point 1500
Formula in cell C14:
Returns "✏", a pencil icon, the Unicode character for code point 9999
Formula in cell C15:
Returns "耀", a Chinese character, the Unicode character for code point 32768
4. Function not working
The UNICHAR function returns:
- #N/A error value if data type is not valid.
- #VALUE! error
- if number is 0 (zero).
- if outside the range.
- #NAME? error if you misspell the function name.
- propagates errors, meaning that if the input contains an error (e.g., #VALUE!, #REF!), the function will return the same error.
7.1 Troubleshooting the error value
When you encounter an error value in a cell a warning symbol appears, displayed in the image above. Press with mouse on it to see a pop-up menu that lets you get more information about the error.
- The first line describes the error if you press with left mouse button on it.
- The second line opens a pane that explains the error in greater detail.
- The third line takes you to the "Evaluate Formula" tool, a dialog box appears allowing you to examine the formula in greater detail.
- This line lets you ignore the error value meaning the warning icon disappears, however, the error is still in the cell.
- The fifth line lets you edit the formula in the Formula bar.
- The sixth line opens the Excel settings so you can adjust the Error Checking Options.
Here are a few of the most common Excel errors you may encounter.
#NULL error - This error occurs most often if you by mistake use a space character in a formula where it shouldn't be. Excel interprets a space character as an intersection operator. If the ranges don't intersect an #NULL error is returned. The #NULL! error occurs when a formula attempts to calculate the intersection of two ranges that do not actually intersect. This can happen when the wrong range operator is used in the formula, or when the intersection operator (represented by a space character) is used between two ranges that do not overlap. To fix this error double check that the ranges referenced in the formula that use the intersection operator actually have cells in common.
#SPILL error - The #SPILL! error occurs only in version Excel 365 and is caused by a dynamic array being to large, meaning there are cells below and/or to the right that are not empty. This prevents the dynamic array formula expanding into new empty cells.
#DIV/0 error - This error happens if you try to divide a number by 0 (zero) or a value that equates to zero which is not possible mathematically.
#VALUE error - The #VALUE error occurs when a formula has a value that is of the wrong data type. Such as text where a number is expected or when dates are evaluated as text.
#REF error - The #REF error happens when a cell reference is invalid. This can happen if a cell is deleted that is referenced by a formula.
#NAME error - The #NAME error happens if you misspelled a function or a named range.
#NUM error - The #NUM error shows up when you try to use invalid numeric values in formulas, like square root of a negative number.
#N/A error - The #N/A error happens when a value is not available for a formula or found in a given cell range, for example in the VLOOKUP or MATCH functions.
#GETTING_DATA error - The #GETTING_DATA error shows while external sources are loading, this can indicate a delay in fetching the data or that the external source is unavailable right now.
7.2 The formula returns an unexpected value
To understand why a formula returns an unexpected value we need to examine the calculations steps in detail. Luckily, Excel has a tool that is really handy in these situations. Here is how to troubleshoot a formula:
- Select the cell containing the formula you want to examine in detail.
- Go to tab “Formulas” on the ribbon.
- Press with left mouse button on "Evaluate Formula" button. A dialog box appears.
The formula appears in a white field inside the dialog box. Underlined expressions are calculations being processed in the next step. The italicized expression is the most recent result. The buttons at the bottom of the dialog box allows you to evaluate the formula in smaller calculations which you control. - Press with left mouse button on the "Evaluate" button located at the bottom of the dialog box to process the underlined expression.
- Repeat pressing the "Evaluate" button until you have seen all calculations step by step. This allows you to examine the formula in greater detail and hopefully find the culprit.
- Press "Close" button to dismiss the dialog box.
There is also another way to debug formulas using the function key F9. F9 is especially useful if you have a feeling that a specific part of the formula is the issue, this makes it faster than the "Evaluate Formula" tool since you don't need to go through all calculations to find the issue..
- Enter Edit mode: Double-press with left mouse button on the cell or press F2 to enter Edit mode for the formula.
- Select part of the formula: Highlight the specific part of the formula you want to evaluate. You can select and evaluate any part of the formula that could work as a standalone formula.
- Press F9: This will calculate and display the result of just that selected portion.
- Evaluate step-by-step: You can select and evaluate different parts of the formula to see intermediate results.
- Check for errors: This allows you to pinpoint which part of a complex formula may be causing an error.
The image above shows cell reference B3 converted to hard-coded value using the F9 key. The UNICHAR function requires valid values which is not the case in this example. We have found what is wrong with the formula.
Tips!
- View actual values: Selecting a cell reference and pressing F9 will show the actual values in those cells.
- Exit safely: Press Esc to exit Edit mode without changing the formula. Don't press Enter, as that would replace the formula part with the calculated value.
- Full recalculation: Pressing F9 outside of Edit mode will recalculate all formulas in the workbook.
Remember to be careful not to accidentally overwrite parts of your formula when using F9. Always exit with Esc rather than Enter to preserve the original formula. However, if you make a mistake overwriting the formula it is not the end of the world. You can “undo” the action by pressing keyboard shortcut keys CTRL + z or pressing the “Undo” button
7.3 Other errors
Floating-point arithmetic may give inaccurate results in Excel - Article
Floating-point errors are usually very small, often beyond the 15th decimal place, and in most cases don't affect calculations significantly.
5. How to convert from formal Unicode notation U+XXXX to its corresponding Unicode character
The following formula converts a hexadecimal unicode to the matching unicode character. Converting hexadecimal Unicode to matching Unicode characters can be useful in several scenarios like:
- Web development and HTML: When you need to include special characters or symbols in HTML that aren't easily typed on a keyboard. For ensuring proper character encoding in web pages, especially for multilingual content.
- Programming and software development: When working with internationalization (i18n) and localization (l10n) in software. For handling text processing in languages with non-ASCII characters.
- Data processing and analysis: When dealing with datasets that contain encoded characters from various languages or symbol sets. For cleaning and standardizing text data that may contain Unicode escape sequences.
- Font and typography work: When designing or working with fonts that include extended character sets. For verifying the presence of specific glyphs in a font.
- Cryptography and encoding: In some encryption or encoding schemes that use Unicode characters.
- Database management: When working with databases that store data in Unicode format but display hexadecimal representations.
- Text editors and IDEs: For inputting special characters in code or text that aren't directly available on the keyboard.
In Excel specifically, functions like UNICHAR can be helpful when you need to display or work with Unicode characters based on their code points, which might be stored or processed as numbers in your data.
Formula in cell C3:
Here is a brief explanation, there is a more detailed explanation below.
- MID(B3,3,99): This function extracts a substring from the value in cell B3. It starts at the 3rd character (skipping the first two characters). It attempts to extract up to 99 characters (or until the end of the string). This is likely used to remove a prefix (like "0x" or "U+") from a hexadecimal Unicode representation.
- HEX2DEC(...): This function converts the extracted hexadecimal string to a decimal number. It takes the result of the MID function as its input. For example, if B3 contained "U+1F600", the MID function would extract "1F600", and HEX2DEC would convert this to 128512.
- UNICHAR(...) Finally, this function takes the decimal number produced by HEX2DEC and returns the corresponding Unicode character. It converts the Unicode code point (in decimal) to its actual character representation.
This formula is useful when you have Unicode characters represented in hexadecimal format (like "U+1F600" for a smiley emoji) and you want to display the actual character in Excel. It effectively decodes the hexadecimal Unicode representation into the visible character.
Explaining formula
Step 1 - Remove two first characters from string (U+)
The MID function returns a substring from a string based on the starting position and the number of characters you want to extract.
Function syntax: MID(text, start_num, num_chars)
MID(B3,3,99)
becomes
MID("U+0045",3,99)
and returns "0045".
Step 2 - Convert hexadecimal to decimal
The HEX2DEC function converts a hexadecimal number to a decimal number.
Function syntax: HEX2DEC(number)
HEX2DEC(MID(B3,3,99))
becomes
HEX2DEC("0045")
and returns 69.
Step 3 - Convert unicode to character
The UNICHAR function calculates a character based on a Unicode number.
Function syntax:
UNICHAR(HEX2DEC(MID(B3,3,99)))
becomes
UNICHAR(69)
and returns "E".
Useful links
UNICHAR function - Microsoft support
UNICODE - wikipedia
Functions in 'Text' category
The UNICHAR function function is one of 29 functions in the 'Text' category.
How to comment
How to add a formula to your comment
<code>Insert your formula here.</code>
Convert less than and larger than signs
Use html character entities instead of less than and larger than signs.
< becomes < and > becomes >
How to add VBA code to your comment
[vb 1="vbnet" language=","]
Put your VBA code here.
[/vb]
How to add a picture to your comment:
Upload picture to postimage.org or imgur
Paste image link to your comment.
Contact Oscar
You can contact me through this contact form