使用Python3.7.3 從讀卡機讀健保卡資料其中姓名出現亂碼 (BIG5轉UTF-8) (已解決)

#ic卡讀取 #編碼轉換 python3 utf-8 pyscard模組

qvrblz619736 2020-06-18 21:32:51 ‧ 5709 瀏覽

分享至

使用pyscard模組搭配讀卡機,我是先用程式範例做測試
IC卡讀取程式參考範例來源
他沒做什麼特殊的轉碼卻能正常顯示姓名(雖然有打馬但看的出來)
後續看了網路上的方法,自己嘗試去改轉成bytes
圖一連結
再次轉換unicode又變成了亂碼,但這次亂碼變得不一樣感覺稍微有前進了一步.....
圖二連結
也嘗試了又被打回原形
圖三連結
--------2020年6月20日早上11點半更新------------
試了網路上很多關於python3編碼轉換的方法
算有小小的進展? 苦笑

所需模組 Pyscard 環境 Linux

from smartcard.System import readers
SelectAPDU = [ 0x00, 0xA4, 0x04, 0x00, 0x10, 0xD1, 0x58, 0x00, 0x00, 0x01, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x11, 0x00 ]
ReadProfileAPDU = [ 0x00, 0xca, 0x11, 0x00, 0x02, 0x00, 0x00 ]
r = readers()
reader = r[0]
connection.connect()
data, sw1, sw2 = connection.transmit(SelectAPDU)
data, sw1, sw2 = connection.transmit(ReadProfileAPDU)
print('姓名 : %s' % ''.join(chr(i) for i in data[12:32]))

看更多先前的討論...收起先前的討論...

froce iT邦大師 1 級 ‧ 2020-06-19 08:14:48 檢舉

python3中，str沒有decode()，只有encode()，會傳回bytes物件。
你可能得先把要處理的字串先給個範例，我們才能幫你。

japhenchen iT邦超人 1 級 ‧ 2020-06-19 09:39:32 檢舉

sa = unicode('姓名 : %s' ''.join(後面省略3000字))

qvrblz619736 iT邦新手 5 級 ‧ 2020-06-20 11:30:09 檢舉

japhenchen 前輩的應該是python2寫法
小弟目前應該算有小小的進展........
試了網路上很多種方法
先上傳一張圖片

froce iT邦大師 1 級 ‧ 2020-06-20 13:04:30 檢舉

我覺得問題在你讀取的時候，讀進來的應該是big5然後被當成utf8了。
讀取的時候加個encoding="big5"試試看

froce iT邦大師 1 級 ‧ 2020-06-20 14:08:07 檢舉

http://boywhy.blogspot.com/2014/11/java-java.html
1.根據這篇，姓名應該是12～31個byte。
2.變數data應該是byte類型...你直接做char()這樣可以嗎？

因為讀取出來的data有隱私的問題，所以我建議你乾脆寫個最小能讀取健保卡的code方便我來測試。

qvrblz619736 iT邦新手 5 級 ‧ 2020-06-20 14:55:23 檢舉

程式碼已經簡化在上面請您參考
我是在 Linux上開發和安裝 pyscard模組使用python3.7.3
Windows(anaconda) 測試安裝不了pyscard這個模組

qvrblz619736 iT邦新手 5 級 ‧ 2020-06-20 14:56:41 檢舉

設備需要一台IC讀卡機做測試

froce iT邦大師 1 級 ‧ 2020-06-20 15:01:38 檢舉

windows可以裝pyscard，是anaconda不行。

登入發表討論

熱門推薦

{{ item.channelVendor }} | {{ item.webinarstarted }} |

直播中

2 個回答

froce

iT邦大師 1 級 ‧ 2020-06-20 14:58:56

最佳解答

今天不想寫工作上的code，來幫忙解題好了。
讀取的code從下面抄的，改成python3。
https://gist.github.com/chihchun/4316159

from smartcard.System import readers

# define the APDUs used in this script
SelectAPDU = [ 0x00, 0xA4, 0x04, 0x00, 0x10, 0xD1, 0x58, 0x00, 0x00, 0x01, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x11, 0x00 ]

ReadProfileAPDU = [ 0x00, 0xca, 0x11, 0x00, 0x02, 0x00, 0x00 ]

# get all the available readers
r = readers()
print ("Available readers:", r)

reader = r[0]
print ("Using:", reader)

connection = reader.createConnection()
connection.connect()

data, sw1, sw2 = connection.transmit(SelectAPDU)
print ("Select Applet: %02X %02X" % (sw1, sw2))

data, sw1, sw2 = connection.transmit(ReadProfileAPDU)
print ("Command: %02X %02X" % (sw1, sw2))
print  ('Card Number : %s' % ''.join(chr(i) for i in data[0:12]))

# 讀卡機讀取進來的的資料的確是12～31這些byte，big5為雙位元字，所以在data中大概是前6～8個
# 在python3中，字串都是以bytes儲存的，你要顯示出來，只需要decode時指定正確的編碼就行了
print("name: {}".format(bytes(data[12:32]).decode("big5")))

print  ('ID Number : %s' % ''.join(chr(i) for i in data[32:42]))
print  ('Birthday : %s' % ''.join(chr(i) for i in data[43:49]))
print  ('Sex : %s' % ''.join(chr(i) for i in data[49:50]))
print  ('Card Date : %s' % ''.join(chr(i) for i in data[51:57]))

回應 3
分享
檢舉

froce iT邦大師 1 級 ‧ 2020-06-20 15:09:32 檢舉

然後我發現寫這段code的也有問題吧。
big5明明都佔雙位元，你對每個位元做chr()，不就解錯了嗎？

qvrblz619736 iT邦新手 5 級 ‧ 2020-06-20 21:11:03 檢舉

感謝前輩大哥指導
編碼問題已解決

atgt35 iT邦新手 5 級 ‧ 2020-10-08 11:11:33 檢舉

我想問說讀取到罕見字的話你們會取代掉嗎? 還是怎麼處理會比較適合
我是想顯示出來，不想取代想問說怎麼處理?

登入發表回應

一級屠豬士

iT邦大師 1 級 ‧ 2020-06-19 07:30:15

別再只貼圖了.
你應該用 str object has no attribute decode 去查一下啊.
Python3 跟 Python2 有很大不同.
Python 的開發時間很早,略早於Unicode,所以一直延續到比較長時間的版本 Python2,
都一直在 encode(), decode().
Python3 是有support Unicode的,所以不需要那樣麻煩,上面已經有兩位提醒你了.