How to calcuate sum of each row for a dataset through MapReduce??

조회 수: 5 (최근 30일)
RJ
RJ 2017년 6월 7일
답변: Raikunta Aruna 2022년 1월 26일
I am working on a data clustering algorithm, I want row sum for each data set ID. I want the answer in the form of key, value.
  댓글 수: 3
Guillaume
Guillaume 2017년 6월 8일
편집: Guillaume 2017년 6월 8일
@Rahul,
Please provide an example of your dataset (with explanation of what each column is, particularly which one is the ID), and explain what you mean by row sum.
How big is your dataset that you need to use mapreduce?
RJ
RJ 2017년 6월 9일
Dataset contains 10K rows and 5 columns, first one is ID. So I want column sum for five columns and rowsum for each row in 10K rows. Thank you for your earlier answer.

댓글을 달려면 로그인하십시오.

답변 (2개)

A. P. B.
A. P. B. 2017년 6월 8일
If you have an N by M matrix; for P number of datasets
1) Read each data set
2)Calculate the sum and the val=sum(data(i,:));
But this gives only the value. Did not get what 'key' means in your query?
Usually 'min' or 'max' function which returns the minimum value or maximum value along a row returns the index value (column index) not the sum function.
  댓글 수: 1
RJ
RJ 2017년 6월 9일
So how to perform sum of columns for a particular row and here Key is ID associated with each data series.

댓글을 달려면 로그인하십시오.


Raikunta Aruna
Raikunta Aruna 2022년 1월 26일
How to find sum of row in a datset?

카테고리

Help CenterFile Exchange에서 Large Files and Big Data에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by