필터 지우기
필터 지우기

Merging table rows, keep all columns

조회 수: 44 (최근 30일)
Marc Elpel
Marc Elpel 2019년 11월 13일
댓글: Marc Elpel 2019년 11월 14일
I'm trying to combine data from multiple tables into one. (data files attached). Seems like a simple join(), or outerjoin(), but every path has run into issues.
Specifically what I want to do:
  1. Add rows from table 2 to table 1.
  2. Keep all rows in both tables (append rows)
  3. Where column names match, use that column
  4. Where columns are new, add column to table width
  5. Keep column names (outer join is renaming based on source table)
  6. Some table values are empty and should combine as empty values in existing and/or new columns as needed.
Tried so far:
  1. Join - Fails do to some empty values
  2. Join w/Replaced nan - fails do to some other key value error
  3. outerjoin() w/multiple configuration options - all failed.
  4. innerjoin90 - does not seem like what I want (throwing out data).
When done combining the attached tables there should be slight more columns than the first table, and rows should be the sum of rows in both tables.
This should be a common issue so assuming I am missing some simple solution...?
Using Matlab 2016b
Marc
  댓글 수: 6
Adam Danz
Adam Danz 2019년 11월 13일
편집: Adam Danz 2019년 11월 13일
I've read-in your tables and the column names match between both tables. Points 3 and 4 in your question (thanks for the numbering - that makes this easy to discuss) mention column names that do not match. Are there supposed to be column names that do not match?
I should add that upon reading in your table, Matlab had to modify some of the column names to conform to Matlab syntax.
Warning: Column headers from the file were modified to make them valid MATLAB identifiers before creating variable names for the table.
The original column headers are saved in the VariableDescriptions property.
Set 'PreserveVariableNames' to true to use the original column headers as table variable names.
files = {'RESULTS_SAMP1.CSV', 'RESULTS_SAMP2.CSV'}; %Full paths are always better
T1 = readtable(files{1},'Delimiter',',');
T2 = readtable(files{2},'Delimiter',',');
% Do column names match?
all(ismember(T1.Properties.VariableNames, T2.Properties.VariableNames)) % Yes
all(ismember(T2.Properties.VariableNames, T1.Properties.VariableNames)) % Yes
Marc Elpel
Marc Elpel 2019년 11월 13일
Tried fixing names first with 'PreserveVariableNames', but this did not work. "No public property PreserveVariableNames exists for class matlab.io.text.DelimitedTextImportOptions." Lesser issue compared to others.
I randomly selected two files and they were giving me merging errors so I thought those had different columns. Some of my data DOES include differences; we can simulate that by deleting the 3rd column int he first table, and 5th column in the second table. (does not matter which we delete, just making them different). What join command will combine these tables keeping all rows, and adding columns as needed to match the data? In some cases there will be missing columns which should be stuffed with empty cells.

댓글을 달려면 로그인하십시오.

채택된 답변

Adam Danz
Adam Danz 2019년 11월 13일
편집: Adam Danz 2019년 11월 13일
% Read in the data
files = {'RESULTS_SAMP1.CSV', 'RESULTS_SAMP2.CSV'}; %Full paths are always better
T1 = readtable(files{1},'Delimiter',',');
T2 = readtable(files{2},'Delimiter',',');
% Simulate column-mismatch
T1 = removevars(T1,'SpecimenType'); % remove col 3
T2 = removevars(T2,'Test'); % remove col 5
% Vertically concatenate tables
T3 = outerjoin(T1,T2,'MergeKeys', true)
  댓글 수: 4
Adam Danz
Adam Danz 2019년 11월 13일
Glad I could help out.
Just so I understand, the problem you're describing isn't with the merging of tables, it's with importing the tables. Is that correct?
Have you tried importing the tables without using the PreserveVariableNames flag?
Could you attach one of the files causing problems?
Marc Elpel
Marc Elpel 2019년 11월 14일
The problem is I need to sterilize the data for posting, and as soon as I make any change and save the file it works. There is something hidden in the original CSV files which is corrupting the importing. Unfortunately I cannot upload these files without modification.
I think I tried the PreserveVariableNames flag which was unknown in 2016b. Not using it now.
I'm going to close teh thread - thanks for your help!

댓글을 달려면 로그인하십시오.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Logical에 대해 자세히 알아보기

제품

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by