Import specific type of text file

Question

Pepe 2019년 1월 14일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/439688-import-specific-type-of-text-file

댓글: Guillaume 2019년 1월 17일

채택된 답변: Jan

code=abas&period=30&endtime=2015-06-30.txt

My text file looks like this:

</td></tr><tr><td>2015-05-31 00:00:00</td><td>1.8136

</td></tr><tr><td>2015-05-31 00:01:00</td><td>1.8137

</td></tr><tr><td>2015-05-31 00:02:00</td><td>1.8136

</td></tr><tr><td>2015-05-31 00:03:00</td><td>1.8138

</td></tr><tr><td>2015-05-31 00:04:00</td><td>1.8136

.

I want to import it to be in two columns: first one a matlab datenum for that date and time and the second one with this decimal number 1.8136 or so.

How can i do that? tnx

There is an attached file. So you can see for every minute in a day there is an observation.

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

Guillaume 2019년 1월 14일

xml is a textual format, so is html. From the snippet you show it's clearly some sort of xml or html in that file. Most likely it's html since I'm not sure xml support tables (which your snippet probably is). Note that html is not designed for data transfer, it's a presentation format, so I would recommend a more reliable way to obtain the data.

In any case, to really clarify what is in that file, please attach the full file.

Pepe 2019년 1월 14일

I've attached it. Thanks for the warning. Please take a look now.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Jan 2019년 1월 14일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/439688-import-specific-type-of-text-file#answer_356338

MATLAB Online에서 열기

Str  = fileread(FileName);
% Mask the HTML tags:
indI = strfind(Str, '<');
indF = strfind(Str, '>');
M    = zeros(size(Str));
M(indI) =  1;
M(indF) = -1;
M       = cumsum(M);
M(indF) = 1;
Str(M == 1) = ' ';
% Read the data:
S      = textscan(Str, '%s %s %f');
Date   = datenum(strcat(S{1}, {' '}, S{2}));
Number = S{3};

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 2

Guillaume 2019년 1월 14일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/439688-import-specific-type-of-text-file#answer_356346

편집: Guillaume 2019년 1월 14일

MATLAB Online에서 열기

Your text file is a portion of a html file. As commented, html is not designed for data transfer and you would be better off finding a better way to get your data. Typically, websites provide a proper method to access their source data (such as xml or json files).

The following will parse your file. However, it's not a proper html parser so it's very possibly that it would fail on other files that you would obtain the same way. Because html is a presentation format, it could contain extra stuff (such as text formatting) that would make the parsing fail. Again, html is not a suitable format for data transfer and it would be near impossible to write a robust parser.

filecontent = fileread('code=abas&period=30&endtime=2015-06-30.txt');  %read the whole content of the file
rows = regexp(filecontent, '(?<=<tr>).*?(?=</tr>)', 'match');   %extract table rows. Does not allow for <tr> attributes (regex takes too long otherwise)
columns = regexp(rows, '(?<=<td[^>]*>).*?(?=</td>)', 'match');       %extract columns of each row. Allows for <td> attributes but nothing else
rawtable = vertcat(columns{:});  %will error if any of the table row has more or less columns than other rows (allowed in html)
data = table(datetime(rawtable(2:end, 1)), str2double(rawtable(2:end, 2)), 'VariableNames', {'Time', 'rad'})

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Guillaume 2019년 1월 17일

Note that my solution is a lot more robust than the accepted solution (which by the way, does not work when I test it on the provided file) and produces a more modern output.

댓글을 달려면 로그인하십시오.

Import specific type of text file

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

채택된 답변

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (1개)

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

Import specific type of text file

댓글 수: 4 이전 댓글 2개 표시이전 댓글 2개 숨기기

채택된 답변

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (1개)

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기