Observing Error while reading .csv files

Question

Mohaiminul 2024년 3월 5일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2090356-observing-error-while-reading-csv-files

편집: Stephen23 2024년 3월 16일

I am facing an issue of extracting data from the .csv files. The program will read the files from several folders inside a main folder. Sample repetitive contents of the csv files can be as below:

key operation statistics

session time (sec) 1

packet try count 18

packet succeed count 17

rx receptions 1

packet-1 success/failure count 18/0

Signal Strength of packet-1 -52.5

Signal Strength of packet-2 -53.5

key operation statistics

session time (sec) 5

packet try count 32

packet succeed count 31

rx receptions 1

packet-1 success/failure count 28/0

Signal Strength of packet-1 -59.5

Signal Strength of packet-2 -60

%The coloumn 1 has alphabetic description and coloumn 2 contains all corresponding data in each csv file.

%The global TX power value is set to -20 dBm. Otherwise, one csv_power value files are saved to detect the TX value such as 6.csv_power file. The code will understand then that 6 dB is the TX value.

My main program file is as below:

%Variables declared initially
MeasurementFolder = 'Name of the main folder where the CSV files are'; %matlab files are kept with the main folder
subFolders = GetSubDirsFirstLevelOnly(MeasurementFolder); %quite not sure this line can perfectly read all the files inside a sub folder
if exist([MeasurementFolder, '/allMeasurements.mat'], 'file')
    load ([MeasurementFolder, '/allMeasurements.mat'])
else
    allMeasurements = struct();                   % make a struct to contain everything
    for s = 1:length(subFolders)
        allMeasurements(s).data = struct();
        allMeasurements(s).name = [MeasurementFolder, '/', subFolders{s}];
        allMeasurements(s).inputCSVFileNames = char(dir([allMeasurements(s).name, '/*.csv']).name);
        allMeasurements(s).TX_power = str2double(extractBefore(string(dir([allMeasurements(s).name, '/*.csv_power']).name), '.csv_power')); 
        if isnan(allMeasurements(s).TX_power) 
            allMeasurements(s).TX_power = TX_power; % saving global value (from variables) if no specific one
        end
    end
    for n = 1:size({allMeasurements.name},2)
        allMeasurements(n).name;
        allMeasurements(n).data = extractInfoFromCSV(allMeasurements(n).data, allMeasurements(n).inputCSVFileNames, allMeasurements(n).name);
    end
    for n = 1:size({allMeasurements.name},2)
        allMeasurements = plotSS(allMeasurements, n, figureDirection, prctileLow, prctileHigh, filteringParameter);
    end
    %(the code continues)

The function which reads the info from the files:

function [outputStruct] = extractInfoFromCSV(inputStruct, inputCSVFileNames, inputFolder)
    outputStruct = inputStruct; 
    keyline_stats = "key operation statistics";
    keyline_session_time = "    session time (sec)";
    keyline_packet_try_count = "    packet try count         ";
    keyline_packet_succeed_count = "    packet succeed count     ";
    keyline_packet1_sf = "    packet-1 success/failure count ";
    keyline_packet2_sf = "    packet-2 success/failure count ";
    keyline_packet1_strength = "    Signal Strength of packet-1          ";
    keyline_packet2_strength = "    Signal Strength of packet-2          ";
    varTypes = {'double','double','double','double','double','double','double','double','double'};
    varNames = {'session time (sec)','packet try count','packet succeed count','packet-1 success count','packet-1 failure count','packet-2 success count','packet-2 failure count','Signal Strength of packet-1','Signal Strength of packet-2'};
% from packet-1 success/failure count and packet-2 success/failure count, we will separate the success and fail count which are divided by /. then the values will be stored in the separate coloumns as ok and nok count. In each csv files, the success and failure count is the cumulative value of the previous counts. So the last value of the packet-1 success/failure count will only be stored, and all the previous values of the same csv file will be ignored.
    for n = 1:size(inputCSVFileNames,1) 
        lines = readlines([inputFolder, '/',inputCSVFileNames(n,:)]); 
        T_temp = readtable([inputFolder, '/', inputCSVFileNames(n,:)], 'Delimiter', ',');
        for ci =1:numel(lines) 
            if strcmp(keyline_stats, (lines(ci)))  
                T_temp.("session time (sec)")(ci+1)   =  str2double(extractAfter(lines(ci+1),keyline_session_time));     
                T_temp.("packet try count")(ci+1)   =  str2double(extractAfter(lines(ci+2),keyline_packet_try_count));          
                T_temp.("packet succeed count")(ci+1)   =  str2double(extractAfter(lines(ci+3),keyline_packet_succeed_count));  
                T_temp.("packet-1 success count")(ci+1)   =  str2double(extractBetween(lines(ci+5),keyline_packet1_sf, "/"));   
                T_temp.("packet-1 failure count")(ci+1)   =  str2double(extractAfter(extractAfter(lines(ci+5),"/"),"/"));    
                T_temp.("packet-2 success count")(ci+1)   =  str2double(extractBetween(lines(ci+6),keyline_packet2_sf, "/"));    
                T_temp.("packet-2 failure count")(ci+1)   =  str2double(extractAfter(extractAfter(lines(ci+6),"/"),"/"));    
                T_temp.("Signal Strength of packet-1")(ci+1)   =  str2double(extractAfter(lines(ci+7),keyline_packet1_strength));              
                T_temp.("Signal Strength of packet-2")(ci+1)   =  str2double(extractAfter(lines(ci+8),keyline_packet2_strength));              
            end
            T_filtered = T_temp(T_temp.("session time (sec)")~=0,:); 
        end
        outputStruct(n).table = T_filtered;
        outputStruct(n).name = strcat('distance_', extractBefore(string(inputCSVFileNames(n,:)), '.')); 
        outputStruct(n).distances = str2double(extractBefore(string(inputCSVFileNames(n,:)), 'm'));
        outputStruct(n).packet1_success_last = T_filtered.("packet-1 success count")(end); 
        outputStruct(n).packet1_failure_last = T_filtered.("packet-1 failure count")(end); 
        outputStruct(n).packet2_success_last = T_filtered.("packet-2 success count")(end); % save distances from file names
    	outputStruct(n).packet2_failure_last = T_filtered.("packet-2 success count")(end); % save distances from file names
    end
    end

There are several other functions which are not being shared here for the ease of the replier.

Error observed: Unrecognized field name "name".

Error in main_file (line 45)

for n = 1:size({allMeasurements.name},2)

What can I do to eliminate the error?

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

Stephen23 2024년 3월 5일

@Mohaiminul: please upload a sample data file by clicking the paperclip button.

Mohaiminul 2024년 3월 5일

500m.csv

the sample data file is attached.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Stephen23 2024년 3월 15일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2090356-observing-error-while-reading-csv-files#answer_1426101

편집: Stephen23 2024년 3월 16일

MATLAB Online에서 열기

500m.csv

Your code is fragile and complex. You should aim to simplify it and make it more generalised:

avoid having lots of hard-coded text values,
avoid cramming too much onto one line of code,
avoid CHAR() in lists of filenames,
use FULLFILE instead of concatenating paths with hard-coded path separators.

For example:

P = '.'; % absolute or relative path to where the file is saved
F = '500m.csv';
C = readcell(fullfile(P,F), 'Delimiter',',')
C = 54×2 cell array
    {'key operation statistics'      }    {[<missing>]}
    {'session time (sec)'            }    {[        1]}
    {'packet try count'              }    {[       18]}
    {'packet succeed count'          }    {[       17]}
    {'rx receptions'                 }    {[        1]}
    {'packet-1 success/failure count'}    {'18/0'     }
    {'packet-2 success/failure count'}    {'18/0'     }
    {'Signal Strength of packet-1'   }    {[ -52.5000]}
    {'Signal Strength of packet-2'   }    {[ -53.5000]}
    {'key operation statistics'      }    {[<missing>]}
    {'session time (sec)'            }    {[        2]}
    {'packet try count'              }    {[       38]}
    {'packet succeed count'          }    {[       37]}
    {'rx receptions'                 }    {[        1]}
    {'packet-1 success/failure count'}    {'36/2'     }
    {'packet-2 success/failure count'}    {'36/0'     }
X = startsWith(C(:,1),'key');
Y = cellfun(@ischar,C(:,2));
T = regexp(C(Y,2),'(\d+)/(\d+)','tokens','once');
C(Y,2) = num2cell(str2double(vertcat(T{:})),2);
T = cell2table(C);
T.Group = cumsum(X);
U = unstack(T,'C2','C1', 'VariableNamingRule','preserve');
U = convertvars(U,@iscell,@(c)vertcat(c{:}))
U = 6×10 table
    Group    Signal Strength of packet-1    Signal Strength of packet-2    key operation statistics    packet succeed count    packet try count    packet-1 success/failure count    packet-2 success/failure count    rx receptions    session time (sec)
    _____    ___________________________    ___________________________    ________________________    ____________________    ________________    ______________________________    ______________________________    _____________    __________________

      1                 -52.5                          -53.5                      <missing>                     17                    18                      18      0                         18      0                    1                  1         
      2                   -49                          -50.5                      <missing>                     37                    38                      36      2                         36      0                    1                  2         
      3                   -50                          -51.5                      <missing>                     57                    58                      56      2                         55      1                    1                  3         
      4                   -51                          -52.5                      <missing>                     77                    78                      76      2                         75      1                    1                  4         
      5                 -49.5                            -51                      <missing>                     97                    98                      96      2                         90      6                    1                  5         
      6                 -48.5                            -50                      <missing>                    117                   118                     116      2                        106     10                    1                  6         

Note that you can easily split the success/failure matrices into column vectors:

U = splitvars(U)
U = 6×12 table
    Group    Signal Strength of packet-1    Signal Strength of packet-2    key operation statistics    packet succeed count    packet try count    packet-1 success/failure count_1    packet-1 success/failure count_2    packet-2 success/failure count_1    packet-2 success/failure count_2    rx receptions    session time (sec)
    _____    ___________________________    ___________________________    ________________________    ____________________    ________________    ________________________________    ________________________________    ________________________________    ________________________________    _____________    __________________

      1                 -52.5                          -53.5                      <missing>                     17                    18                          18                                  0                                   18                                   0                         1                  1         
      2                   -49                          -50.5                      <missing>                     37                    38                          36                                  2                                   36                                   0                         1                  2         
      3                   -50                          -51.5                      <missing>                     57                    58                          56                                  2                                   55                                   1                         1                  3         
      4                   -51                          -52.5                      <missing>                     77                    78                          76                                  2                                   75                                   1                         1                  4         
      5                 -49.5                            -51                      <missing>                     97                    98                          96                                  2                                   90                                   6                         1                  5         
      6                 -48.5                            -50                      <missing>                    117                   118                         116                                  2                                  106                                  10                         1                  6         

And also give those columns better names:

V = U.Properties.VariableNames;
H = @(idx,varargin) varargin{str2double(idx)};
V = regexprep(V,'(\w+)/(\w+)\s*(\w+)_(\d+)$','${H($4,$1,$2)} $3');
U.Properties.VariableNames = V
U = 6×12 table
    Group    Signal Strength of packet-1    Signal Strength of packet-2    key operation statistics    packet succeed count    packet try count    packet-1 success count    packet-1 failure count    packet-2 success count    packet-2 failure count    rx receptions    session time (sec)
    _____    ___________________________    ___________________________    ________________________    ____________________    ________________    ______________________    ______________________    ______________________    ______________________    _____________    __________________

      1                 -52.5                          -53.5                      <missing>                     17                    18                     18                        0                         18                         0                    1                  1         
      2                   -49                          -50.5                      <missing>                     37                    38                     36                        2                         36                         0                    1                  2         
      3                   -50                          -51.5                      <missing>                     57                    58                     56                        2                         55                         1                    1                  3         
      4                   -51                          -52.5                      <missing>                     77                    78                     76                        2                         75                         1                    1                  4         
      5                 -49.5                            -51                      <missing>                     97                    98                     96                        2                         90                         6                    1                  5         
      6                 -48.5                            -50                      <missing>                    117                   118                    116                        2                        106                        10                    1                  6         

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 2

Pratyush Swain 2024년 3월 14일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2090356-observing-error-while-reading-csv-files#answer_1425586

편집: Pratyush Swain 2024년 3월 15일

MATLAB Online에서 열기

Hi Mohaiminul,

Please ensure the function "GetSubDirsFirstLevelOnly" function correctly retrieves the names of the first-level subdirectories as you might be retreving empty subfolders due to which the "allMeasurements" struct will not contain any "data" field and hence produce the error.

You can follow the given implementation:

function [subDirsNames] = GetSubDirsFirstLevelOnly(parentDir)
    % Get a list of all files and folders in this folder.
    files    = dir(parentDir);
    names    = {files.name};
    % Get a logical vector that tells which is a directory.
    dirFlags = [files.isdir] & ...
              ~strcmp(names, '.') && ~strcmp(names, '..');
    % Extract only those that are directories.
    subDirsNames = names(dirFlags);
end

Also you can specify error handling mechanisms like a check to ensure "allMeasurements" is not empty before entering the loop.

if ~isempty(allMeasurements)
    for n = 1:size({allMeasurements.name},2)
        % Your loop implementation
    end
else
    warning('No measurements found.');
end

For more information, please go through the following references:

1- https://www.mathworks.com/matlabcentral/answers/166629-is-there-any-way-to-list-all-folders-only-in-the-level-directly-below-a-selected-directory#comment_624696

Hope this helps.

EDIT: Previous answer contained "GetSubDirsFirstLevelOnly" function implementation which is not a general solution and should be avoided, it has been corrected now.