2024 iThome 鐵人賽

DAY 8

Python

Python和R入門語法比較系列第 8 篇

02-1 encoding 編碼是什麼

16th鐵人賽

carplee

團隊為你抓鯉魚

2024-09-21 00:17:54

150 瀏覽

分享至

不同的編碼方式，在解碼的時候，可能導致亂碼，如下圖所示：

Python

請先執行：

def decode1():
    x = input()
    a = ''
    for i in x:
        a+=i
        if len(a)<6:
            c = chr(int(a))
            print(c)
        else:
            a=a[-1]

def decode2():
    x = input()
    a = ''
    for i in x:
        a+=i
        if len(a)==5:
            c=chr(int(a))
            print(c, end='')
        if len(a)>=6:
            a=a[-1]

decode1 和 decode2 是我寫的二個功能(function)，中文又稱作函式，最後一行的 # 解碼，示範解碼 "20320229092196665311"這串數字。

編碼

以ASCII,big5,utf8,utf16四個編碼方式為例：

ASCII

中文轉ASCII

ord( ) <-> chr( )

ord('你')

chr(20320)

    '你'

big5(cp950)

認識中文字元碼

Unicode utf-8

a = '好'
a.encode('utf8')

    b'\xe5\xa5\xbd'

Unicode utf-16

print('\u4f60', '\u597d')

    你 好

# ascii將中文字轉成utf16
ascii('你好嗎？')

    "'\\u4f60\\u597d\\u55ce\\uff1f'"

解碼(decoding)

# 記得先執行前面的 def

20320229092196665311

#decode1()
decode2()

    20320229092196665311
    你好嗎？

讀檔

讀檔的時候，有時會加encoding='utf-8'這一行，來解決亂碼的問題。

with open('檔名.副檔名', encoding='utf-8') as f:
    f.read...

內容預告：

02-2 Python的read... #讀檔

03 more about csv in Python and R

04 Python: pandas Series 數值資料 v R: 數值向量

05 Python: Pandas Series 字串資料 v. R:文字向量

06 日期 in Python and R

01 Python的write( ) #寫檔和R語法

02-2 Python的read...和 R語法 #讀檔

系列文

Python和R入門語法比較共 30 篇

RSS系列文訂閱系列文

1 人訂閱

完整目錄

直播研討會

{{ item.channelVendor }} {{ item.webinarstarted }} |

直播中

尚未有邦友留言

立即登入留言

參賽組數

1064 組

團體組數

40 組

累計文章數

22195 篇

完賽人數

600 人

15th鐵人賽 16th鐵人賽 13th鐵人賽 14th鐵人賽 12th鐵人賽 11th鐵人賽鐵人賽 2019鐵人賽 javascript 2018鐵人賽 python 2017鐵人賽 windows php c# windows server linux css react vue.js

IT邦幫忙

Python和R入門語法比較系列 第 8 篇

02-1 encoding 編碼 是什麼

Python

編碼

ASCII

big5(cp950)

Unicode utf-8

Unicode utf-16

解碼(decoding)

讀檔

內容預告：

02-2 Python的read... #讀檔

03 more about csv in Python and R

04 Python: pandas Series 數值資料 v R: 數值向量

05 Python: Pandas Series 字串資料 v. R:文字向量

06 日期 in Python and R

尚未有邦友留言

標記使用者

Python和R入門語法比較系列第 8 篇

02-1 encoding 編碼是什麼