필터 지우기
필터 지우기

How to check a txt file is GBK format or UTF-8 format ?

조회 수: 13 (최근 30일)
宁 祝
宁 祝 2022년 6월 14일
답변: Shubham Dhanda 2023년 6월 28일
How to check a txt file is GBK format or UTF-8 format ?

답변 (1개)

Shubham Dhanda
Shubham Dhanda 2023년 6월 28일
Hi, 
I understand that you want to find whether the encoding of the specified text file is GBK or UTF-8.
Below is the MATLAB code to check the encoding of a txt file:
% Specify the file path and name
filename = 'untitled.txt';
% Read the file as a binary stream
fid = fopen(filename, 'rb');
data = fread(fid);
fclose(fid);
% Check if the file is UTF-8 encoded
isUTF8 = isequal(data(1:3), [239; 187; 191]);
% Check if the file is GBK encoded
isGBK = false;
try
decodedText = native2unicode(data, 'GBK');
isGBK = true;
catch
% GBK decoding failed, indicating it's not GBK encoded
end
% Check the encoding
if isUTF8
disp('The file is in UTF-8 format.');
elseif isGBK
disp('The file is in GBK format.');
else
disp('The file encoding is not UTF-8 or GBK.');
% You can assume it is encoded in another format
end
Hope this helps.

카테고리

Help CenterFile Exchange에서 Data Import and Export에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by