Diacritic remover

Diacritic/Accent remover

How to use this diacritic remover?

Diacritic (accent) remover can help you with removing special (accent) characters from text. Simply copy & paste your diacritic dirty text to left side textarea and see result on the right side. For quicker use click on copy bellow right textarea to copy formatted text.

How accent remover works?

This tool will convert any accent characters to their non accent variant. The result of this conversion is in 26 letters latin alphabet.

Orignal:

En général, c'est à ce moment-là 
que je me rends compte que je suis en retard !

After removing diacritic:

En general, c'est a ce moment-la 
que je me rends compte que je suis en retard !

Why accent characters exists?

Diacritic is an extra glyph added to letter. This glyphs can be found in many languages such as French, Czech, Spanish and so on. Name of this extra glyphs "diacritic" comes from acient greek word "diakritikós" which can be translated to "distinguishing". The main purpose of diacritic is to add some extra sound value to original letter. There are few groups of accent glyphs which are used in many languages such as accents, dots, curves as so on. Writing without using diacritic can result into invalid or non officialy looking text.

Computer era

The diacritic looks good, it adds some extra special vibe to text, but it's really painful for computers. Back in days when computers don't have much memory they just simply can not include all glyphs required for every language. To limit size of text only ASCII standart was used.

This standard used only simple english alphabet and some special characters such as numbers, question mark and so on. This was great for performance but problem with accent was not solved yet. The solution was to add multiple encoding standarts for each languages and it quickly became a mess.

Have you seen something similiar to this? I bet you do, this is result of not using right encoding. the text is decoded in wrong language set.

            ÉGÉìÉRÅ[ÉfÉBÉìÉOÇÕìÔÇ
          

The real solution of letters and glyphs came in 1991. It's called Unicode, internation standart of how text should be encoded. The Unicode Transformation Format also know as "UTF-8" is encoding schema for all characters possible. Nowdays is widely used, and people don't have much text encoding issues from then. But when you are trying to save some space removing diacritics can be really helpful. And that's why we built this simple tool.