2010-03-01 53 views

Trả lời

12

Các tài liệu tôi đã trước đây là this XML file từ W3C. Nó ánh xạ Unicode sang HTML, MathML, LaTeX, Mathematica và các ngôn ngữ khác. (Các tập tin là 1,4 MB, không nén.)

Bạn có thể đọc thêm về nó ở đây: http://www.w3.org/TR/unicode-xml/

+0

Genius! Đã tìm kiếm điều này cho lứa tuổi. Thật hữu ích! – by0

+0

Trong trường hợp bất cứ ai cần điều này thường xuyên (như tôi), tôi đã tạo một ứng dụng web nhỏ để làm cho người tìm kiếm dễ dàng hơn cày thông qua tệp XML đó: https://digitalheir.github.io/mathy-unicode-characters/ – Maarten

4

Tôi đã từng nấu chín lên này cho một máy phát điện báo cáo viết bằng Java (do đó literals Java String):

'\\'(REVERSE SOLIDUS) "\\textbackslash{}" 
'^'(CIRCUMFLEX ACCENT) "$\\uparrow$" 
'_'(LOW LINE) "\\textunderscore{}" 
'|'(VERTICAL LINE) "\\vline{}" 
'~'(TILDE) "\\textasciitilde{}" "~" 
'§'(SECTION SIGN) "\\S{}" 
'ª'(FEMININE ORDINAL INDICATOR) "$^a$" 
'­'(SOFT HYPHEN) "\\-" 
'²'(SUPERSCRIPT TWO) "$^2$" 
'³'(SUPERSCRIPT THREE) "$^3$" 
'·'(MIDDLE DOT) "$\\cdot$" 
'¹'(SUPERSCRIPT ONE) "$^1$" 
'º'(MASCULINE ORDINAL INDICATOR) "$^o$" 
'\u013a'(LATIN SMALL LETTER L WITH ACUTE) "\\'l" 
'\u013b'(LATIN CAPITAL LETTER L WITH CEDILLA) "\\c{L}" 
'\u013c'(LATIN SMALL LETTER L WITH CEDILLA) "\\c{l}" 
'\u013d'(LATIN CAPITAL LETTER L WITH CARON) "\\v{L}" 
'\u013e'(LATIN SMALL LETTER L WITH CARON) "\\v{l}" 
'\u013f'(LATIN CAPITAL LETTER L WITH MIDDLE DOT) "L\\hspace{-0.35em}$\\cdot$" 
'\u0140'(LATIN SMALL LETTER L WITH MIDDLE DOT) "l$\\cdot$" 
'\u0141'(LATIN CAPITAL LETTER L WITH STROKE) "\\L{}" 
'\u0142'(LATIN SMALL LETTER L WITH STROKE) "\\l{}" 
'\u0143'(LATIN CAPITAL LETTER N WITH ACUTE) "\\'N" 
'\u0144'(LATIN SMALL LETTER N WITH ACUTE) "\\'n" 
'\u0145'(LATIN CAPITAL LETTER N WITH CEDILLA) "\\c{N}" 
'\u0146'(LATIN SMALL LETTER N WITH CEDILLA) "\\c{n}" 
'\u0147'(LATIN CAPITAL LETTER N WITH CARON) "\\v{N}" 
'\u0148'(LATIN SMALL LETTER N WITH CARON) "\\v{n}" 
'\u0149'(LATIN SMALL LETTER N PRECEDED BY APOSTROPHE) "'n" 
'\u014c'(LATIN CAPITAL LETTER O WITH MACRON) "\\={O}" 
'\u014d'(LATIN SMALL LETTER O WITH MACRON) "\\={o}" 
'\u014e'(LATIN CAPITAL LETTER O WITH BREVE) "\\u{O}" 
'\u014f'(LATIN SMALL LETTER O WITH BREVE) "\\u{o}" 
'\u0150'(LATIN CAPITAL LETTER O WITH DOUBLE ACUTE) "\\H{O}" 
'\u0151'(LATIN SMALL LETTER O WITH DOUBLE ACUTE) "\\H{o}" 
'\u0152'(LATIN CAPITAL LIGATURE OE) "\\OE{}" 
'\u0153'(LATIN SMALL LIGATURE OE) "\\oe{}" 
'\u0154'(LATIN CAPITAL LETTER R WITH ACUTE) "\\'{R}" 
'\u0155'(LATIN SMALL LETTER R WITH ACUTE) "\\'{r}" 
'\u0156'(LATIN CAPITAL LETTER R WITH CEDILLA) "\\c{R}" 
'\u0157'(LATIN SMALL LETTER R WITH CEDILLA) "\\c{r}" 
'\u0158'(LATIN CAPITAL LETTER R WITH CARON) "\\v{R}" 
'\u0159'(LATIN SMALL LETTER R WITH CARON) "\\v{r}" 
'\u015a'(LATIN CAPITAL LETTER S WITH ACUTE) "\\'S" 
'\u015b'(LATIN SMALL LETTER S WITH ACUTE) "\\'s" 
'\u015c'(LATIN CAPITAL LETTER S WITH CIRCUMFLEX) "\\^{S}" 
'\u015d'(LATIN SMALL LETTER S WITH CIRCUMFLEX) "\\^{s}" 
'\u015e'(LATIN CAPITAL LETTER S WITH CEDILLA) "\\c{S}" 
'\u015f'(LATIN SMALL LETTER S WITH CEDILLA) "\\c{s}" 
'\u0160'(LATIN CAPITAL LETTER S WITH CARON) "\\v{S}" 
'\u0161'(LATIN SMALL LETTER S WITH CARON) "\\v{s}" 
'\u0162'(LATIN CAPITAL LETTER T WITH CEDILLA) "\\c{T}" 
'\u0163'(LATIN SMALL LETTER T WITH CEDILLA) "\\c{t}" 
'\u0164'(LATIN CAPITAL LETTER T WITH CARON) "\\v{T}" 
'\u0165'(LATIN SMALL LETTER T WITH CARON) "\\v{t}" 
'\u0168'(LATIN CAPITAL LETTER U WITH TILDE) "\\~{U}" 
'\u0169'(LATIN SMALL LETTER U WITH TILDE) "\\~{u}" 
'\u016a'(LATIN CAPITAL LETTER U WITH MACRON) "\\={U}" 
'\u016b'(LATIN SMALL LETTER U WITH MACRON) "\\={u}" 
'\u016c'(LATIN CAPITAL LETTER U WITH BREVE) "\\u{U}" 
'\u016d'(LATIN SMALL LETTER U WITH BREVE) "\\u{u}" 
'\u016e'(LATIN CAPITAL LETTER U WITH RING ABOVE) "\\r{U}" 
'\u016f'(LATIN SMALL LETTER U WITH RING ABOVE) "\\r{u}" 
'\u0170'(LATIN CAPITAL LETTER U WITH DOUBLE ACUTE) "\\H{U}" 
'\u0171'(LATIN SMALL LETTER U WITH DOUBLE ACUTE) "\\H{u}" 
'\u0172'(LATIN CAPITAL LETTER U WITH OGONEK) "\\k{U}" 
'\u0173'(LATIN SMALL LETTER U WITH OGONEK) "\\k{u}" 
'\u0174'(LATIN CAPITAL LETTER W WITH CIRCUMFLEX) "\\^{W}" 
'\u0175'(LATIN SMALL LETTER W WITH CIRCUMFLEX) "\\^{w}" 
'\u0176'(LATIN CAPITAL LETTER Y WITH CIRCUMFLEX) "\\^{Y}" 
'\u0177'(LATIN SMALL LETTER Y WITH CIRCUMFLEX) "\\^{y}" 
'\u0178'(LATIN CAPITAL LETTER Y WITH DIAERESIS) "\\\"Y" 
'\u0179'(LATIN CAPITAL LETTER Z WITH ACUTE) "\\'Z" 
'\u017a'(LATIN SMALL LETTER Z WITH ACUTE) "\\'z" 
'\u017b'(LATIN CAPITAL LETTER Z WITH DOT ABOVE) "\\.{Z}" 
'\u017c'(LATIN SMALL LETTER Z WITH DOT ABOVE) "\\.{z}" 
'\u017d'(LATIN CAPITAL LETTER Z WITH CARON) "\\v{Z}" 
'\u017e'(LATIN SMALL LETTER Z WITH CARON) "\\v{z}" 
'\u01CD'(LATIN CAPITAL LETTER A WITH CARON) "\\v A" 
'\u01CE'(LATIN SMALL LETTER A WITH CARON) "\\v a" 
'\u01CF'(LATIN CAPITAL LETTER I WITH CARON) "\\v I" 
'\u01D0'(LATIN SMALL LETTER I WITH CARON) "\\v \\i{}" 
'\u01D1'(LATIN CAPITAL LETTER O WITH CARON) "\\v O" 
'\u01D2'(LATIN SMALL LETTER O WITH CARON) "\\v o" 
'\u01D3'(LATIN CAPITAL LETTER U WITH CARON) "\\v U" 
'\u01D4'(LATIN SMALL LETTER U WITH CARON) "\\v u" 
'\u01D5'(LATIN CAPITAL LETTER U WITH DIAERESIS AND MACRON) "\\=Ü" 
'\u01D6'(LATIN SMALL LETTER U WITH DIAERESIS AND MACRON) "\\=ü" 
'\u01D7'(LATIN CAPITAL LETTER U WITH DIAERESIS AND ACUTE) "\\'Ü" 
'\u01D8'(LATIN SMALL LETTER U WITH DIAERESIS AND ACUTE) "\\'ü" 
'\u01D9'(LATIN CAPITAL LETTER U WITH DIAERESIS AND CARON) "\\v Ü" 
'\u01DA'(LATIN SMALL LETTER U WITH DIAERESIS AND CARON) "\\v ü" 
'\u01DB'(LATIN CAPITAL LETTER U WITH DIAERESIS AND GRAVE) "\\` Ü" 
'\u01DC'(LATIN SMALL LETTER U WITH DIAERESIS AND GRAVE) "\\` ü" 
'\u01DE'(LATIN CAPITAL LETTER A WITH DIAERESIS AND MACRON) "\\= Ä" 
'\u01DF'(LATIN SMALL LETTER A WITH DIAERESIS AND MACRON) "\\= ä" 
'\u01E6'(LATIN CAPITAL LETTER G WITH CARON) "\\v G" 
'\u01E7'(LATIN SMALL LETTER G WITH CARON) "\\v g" 
'\u01E8'(LATIN CAPITAL LETTER K WITH CARON) "\\v K" 
'\u01E9'(LATIN SMALL LETTER K WITH CARON) "\\v k" 
'\u01EA'(LATIN CAPITAL LETTER O WITH OGONEK) "\\k O" 
'\u01EB'(LATIN SMALL LETTER O WITH OGONEK) "\\k o" 
'\u01F1'(LATIN CAPITAL LETTER DZ) "DZ" 
'\u01F2'(LATIN CAPITAL LETTER D WITH SMALL LETTER Z) "Dz" 
'\u01F3'(LATIN SMALL LETTER DZ) "dz" 
'\u01F4'(LATIN CAPITAL LETTER G WITH ACUTE) "\\'G" 
'\u01F5'(LATIN SMALL LETTER G WITH ACUTE) "\\`G" 
'\u01F8'(LATIN CAPITAL LETTER N WITH GRAVE) "\\`N" 
'\u01F9'(LATIN SMALL LETTER N WITH GRAVE) "\\`n" 
'\u01FA'(LATIN CAPITAL LETTER A WITH RING ABOVE AND ACUTE) "\\'Å" 
'\u01FB'(LATIN SMALL LETTER A WITH RING ABOVE AND ACUTE) "\\'å" 
'\u01FC'(LATIN CAPITAL LETTER AE WITH ACUTE) "\\'Æ" 
'\u01FD'(LATIN SMALL LETTER AE WITH ACUTE) "\\'æ" 
'\u01FE'(LATIN CAPITAL LETTER O WITH STROKE AND ACUTE) "\\'Ø" 
'\u01FF'(LATIN SMALL LETTER O WITH STROKE AND ACUTE) "\\'ø" 
'\u0200'(LATIN CAPITAL LETTER A WITH DOUBLE GRAVE) "\\textdoublegrave{A}" 
'\u0201'(LATIN SMALL LETTER A WITH DOUBLE GRAVE) "\\textdoublegrave{A}" 
'\u0202'(LATIN CAPITAL LETTER A WITH INVERTED BREVE) "\\textroundcap{A}" 
'\u0203'(LATIN SMALL LETTER A WITH INVERTED BREVE) "\\textroundcap{a}" 
'\u0204'(LATIN CAPITAL LETTER E WITH DOUBLE GRAVE) "\\textdoublegrave{E}" 
'\u0205'(LATIN SMALL LETTER E WITH DOUBLE GRAVE) "\\textdoublegrave{e}" 
'\u0206'(LATIN CAPITAL LETTER E WITH INVERTED BREVE) "\\textroundcap{A}" 
'\u0207'(LATIN SMALL LETTER E WITH INVERTED BREVE) "\\textroundcap{a}" 
'\u0208'(LATIN CAPITAL LETTER I WITH DOUBLE GRAVE) "\\textdoublegrave{I}" 
'\u0209'(LATIN SMALL LETTER I WITH DOUBLE GRAVE) "\\textdoublegrave{\\i}" 
'\u020A'(LATIN CAPITAL LETTER I WITH INVERTED BREVE) "\\textroundcap{I}" 
'\u020B'(LATIN SMALL LETTER I WITH INVERTED BREVE) "\\textroundcap{\\i}" 
'\u020C'(LATIN CAPITAL LETTER O WITH DOUBLE GRAVE) "\\textdoublegrave{O}" 
'\u020D'(LATIN SMALL LETTER O WITH DOUBLE GRAVE) "\\textdoublegrave{o}" 
'\u020E'(LATIN CAPITAL LETTER O WITH INVERTED BREVE) "\\textroundcap{O}" 
'\u020F'(LATIN SMALL LETTER O WITH INVERTED BREVE) "\\textroundcap{o}" 
'\u0210'(LATIN CAPITAL LETTER R WITH DOUBLE GRAVE) "\\textdoublegrave{R}" 
'\u0211'(LATIN SMALL LETTER R WITH DOUBLE GRAVE) "\\textdoublegrave{r}" 
'\u0212'(LATIN CAPITAL LETTER R WITH INVERTED BREVE) "\\textroundcap{R}" 
'\u0213'(LATIN SMALL LETTER R WITH INVERTED BREVE) "\\textroundcap{r}" 
'\u0214'(LATIN CAPITAL LETTER U WITH DOUBLE GRAVE) "\\textdoublegrave{U}" 
'\u0215'(LATIN SMALL LETTER U WITH DOUBLE GRAVE) "\\textdoublegrave{u}" 
'\u0216'(LATIN CAPITAL LETTER U WITH INVERTED BREVE) "\\textroundcap{U}" 
'\u0217'(LATIN SMALL LETTER U WITH INVERTED BREVE) "\\textroundcap{u}" 
'\u0218'(LATIN CAPITAL LETTER S WITH COMMA BELOW) "\\textcommabelow{S}" 
'\u0219'(LATIN SMALL LETTER S WITH COMMA BELOW) "\\textcommabelow{s}" 
'\u021A'(LATIN CAPITAL LETTER T WITH COMMA BELOW) "\\textcommabelow{T}" 
'\u021B'(LATIN SMALL LETTER T WITH COMMA BELOW) "\\textcommabelow{t}" 
'\u021E'(LATIN CAPITAL LETTER H WITH CARON) "\\v{H}" 
'\u021F'(LATIN SMALL LETTER H WITH CARON) "\\v{h}" 
'\u0226'(LATIN CAPITAL LETTER A WITH DOT ABOVE) "\\.A" 
'\u0227'(LATIN SMALL LETTER A WITH DOT ABOVE) "\\.a" 
'\u0228'(LATIN CAPITAL LETTER E WITH CEDILLA) "\\c E" 
'\u0229'(LATIN SMALL LETTER E WITH CEDILLA) "\\c e" 
'\u022A'(LATIN CAPITAL LETTER O WITH DIAERESIS AND MACRON) "\\= Ö" 
'\u022B'(LATIN SMALL LETTER O WITH DIAERESIS AND MACRON) "\\= ö" 
'\u022C'(LATIN CAPITAL LETTER O WITH TILDE AND MACRON) "\\makeatletter\\@tabacckludge={\\~O}\\makeatother{}" 
'\u022D'(LATIN SMALL LETTER O WITH TILDE AND MACRON) "\\makeatletter\\@tabacckludge={\\~o}\\makeatother{}" 
'\u022E'(LATIN CAPITAL LETTER O WITH DOT ABOVE) "\\.O" 
'\u022F'(LATIN SMALL LETTER O WITH DOT ABOVE) "\\.o" 
'\u0232'(LATIN CAPITAL LETTER Y WITH MACRON) "\\=Y" 
'\u0233'(LATIN SMALL LETTER Y WITH MACRON) "\\=y" 
'\u023A'(LATIN CAPITAL LETTER A WITH STROKE) "/\\hspace{-0.5em}A" 
'\u023B'(LATIN CAPITAL LETTER C WITH STROKE) "/\\hspace{-0.5em}C" 
'\u023C'(LATIN SMALL LETTER C WITH STROKE) "/\\hspace{-0.4em}c" 
'\u023D'(LATIN CAPITAL LETTER L WITH BAR) "-\\hspace{-0.3em}L" 
'\u023E'(LATIN CAPITAL LETTER T WITH DIAGONAL STROKE) "-\\hspace{-0.3em}T" 
'\u20AC'(EURO SIGN) "\\texteuro{}" 
'\u2018'(LEFT SINGLE QUOTATION MARK) "'" 
'\u2019'(RIGHT SINGLE QUOTATION MARK) "'" 
'\u201A'(SINGLE LOW-9 QUOTATION MARK) "'" 
'\u201B'(SINGLE HIGH-REVERSED-9 QUOTATION MARK) "'" 
'\u201C'(LEFT DOUBLE QUOTATION MARK) "\"{}" 
'\u201D'(RIGHT DOUBLE QUOTATION MARK) "\"{}" 
'\u201E'(DOUBLE LOW-9 QUOTATION MARK) "\"{}" 
'\u201F'(DOUBLE HIGH-REVERSED-9 QUOTATION MARK) "\"{}" 
'\u025B'(LATIN SMALL LETTER OPEN E) "\\textepsilon{}" 
'\u0283'(LATIN SMALL LETTER ESH) "\\textesh{}" 

Nhưng tôi khá chắc chắn rằng không có một toàn diện lập bản đồ bất cứ nơi nào - Unicode là KHỔNG LỒ. Có thể bạn sẽ phải tự biên dịch và tự mình duy trì nó. Chúc may mắn!

+0

Danh sách rất ấn tượng! Bạn có thể giúp bình luận làm thế nào để biên dịch một danh sách toàn diện hơn? – SOUser

0

Bạn có thể xem LaTeX to Unicode converter của mình. Nó có một API JavaScript mà bạn có thể sử dụng theo giấy phép MIT. Nó là một phần dựa trên tài liệu W3C được chia sẻ trước đó, nhưng hỗ trợ nhiều ánh xạ hơn mà tôi thu thập được từ đây và ở đó.

Hầu hết các ánh xạ đều là bảng tra cứu đơn giản, nhưng một số lệnh không có hoặc tương đương Unicode không rõ ràng. Trình chuyển đổi toàn diện yêu cầu các quyết định sáng tạo. Ví dụ, phân số khá phức tạp. frac{5}{8} sản xuất , frac{5}{80} sản xuất ‌ 5⁄80‌frac{5}{80a} sản xuất (5/(80a))).

Các vấn đề liên quan