With PCA, how much of the photo did i compress?

Question

ali yaman 2022년 7월 15일

1
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1761175-with-pca-how-much-of-the-photo-did-i-compress

댓글: William Rose 2022년 8월 6일

채택된 답변: William Rose

ben.png

MATLAB Online에서 열기

Hi all,

Here my own PCA algorithm code that i create by myself by inspiring Prof. Andrew Ng's ML lectures.

It works very well.

The problem that I do not know how much of the original photo i compressed and how to press it on the title of the photo.

In title i pressen the number of colums for original and number of principal component for reversed (reconstracted) photo, is that a true way to specify how much i compressed?

SO, could you please check and edit the title(sprintf('....')) parts?

clear

close all

clc

a=imread('ben.png');

b=a(:,:,2);

X=double(b)/255;

% imshow(X)=imshow(b) ile aynıdır

% imagesc(X)= imagesc(b) ikisi renkli

[U,S,Xn]=pca(X);

% K = 20; % or find K with below algorithm

ss=sum(sum(S));

for K=1:size(X,2)

ss2(K)=S(K,K);

if sum(ss2)/ss >=0.99

break

end

Z = projectData(Xn, U, K);

Xappx= recoverData(Z, U, K);

figure

subplot(1, 2, 1);

imshow(b)

title(sprintf('Original: %d features', size(X,2)));

axis square;

subplot(1, 2, 2);

imshow(Xappx)

title(sprintf('Recovered: with top %d principal component', K));

axis square;

function [U, S, X] = pca(X)

% U = zeros(n);

% S = zeros(n);

m=size(X,1);

sigma = (1/m)*(X'*X);

[U, S , ~] = svd(sigma);

end

function Z = projectData(X, U, K)

% Z = zeros(size(X, 1), K);

U_reduce = U(:,(1:K)); % n x K

Z = X * U_reduce; % m x k

end

function X_rec = recoverData(Z, U, K)

% X_rec = zeros(size(Z, 1), size(U, 1));

% m * n

X_rec = Z * U(:,1:K)'; %=m*n

end

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

William Rose 2022년 7월 15일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1761175-with-pca-how-much-of-the-photo-did-i-compress#answer_1008505

@ali yaman,

The percent compression is (Au-Ac)/Au, where Au= the amount of information needed to generate the uncompressed image, and Ac= the amount of information needed to generate the compressed image.

Au: Image ben.png is 1408x1849 pixels. The monochrome image (for example, the green channel of the image, which you chose) has 1 byte per pixel, as represented in Matab. Therefore Au=2,603,392 bytes. (A monochrome PNG file on disk will probably be smaller, because PNG uses a lossless compression algorithm.)

Ac: To reconstruct an image that has been compressed with PCA, you need the basis set images and the weighting factors which tell you how much of each basis image to use in the reconstruction. Therefore PCA actually requires MORE infrmation that the original image, if you are only compressing one image: You have to supply each of the basis images, plus the weighting factors. If you are compressing a large set of images, then PCA can produce good compression, because you use a common basis set for all the images, and a small set of wegihting factors for each images. For example, supose you had 1000 images with the same size as "ben". The raw monocrome images require 1408x1849x1000 bytes, i.e. Au=2.603 x 10^9 bytes. If you recontsruct the images using the first 20 principal components, you would need 1408x1849x20 for the basis images, plus 20x1000 for the weighting factors, i.e Ac=5.209 x 10^6 bytes.

The compression percentage, if you had 1000 images and reconstructed them using 20 principal components (i.e. a basis set of 20 images), would be (Au-Ac)/Au= 98%. The compression ratio for one image with PCA is a negative number, bcause you need more informatin to reconstruct it with PCA than the original image.

See this article for more.

For a set of RGB images, you can do PCA on each color independently.

댓글 수: 2
없음 표시없음 숨기기

ali yaman 2022년 7월 16일

Hi @William Rose,

Thank you for your comprehensive answer. I think I got it.

So, I should not use PCA in just one data (only one photo) but I use in many data, right?

Also, we can not determine whether sucseccfully i made compression or not, because it is just one photo and thats why compression ratio is negative, right?

Thanks.

William Rose 2022년 7월 18일

편집: William Rose 2022년 7월 19일

MATLAB Online에서 열기

@ali yaman,

[edited: I should have said "row" in csome places where I said "column", and I added text to clarify.]

The description I referenced for PCA on a set of images is one approach, but it is not the only approach. When you do PCA on a set of images, as described in the website I cited, each pricipal component is itself an image. Each individual image in the library of compressed images is then reconstructed by adding varying amounts of the different principal component images. That method only makes sense to use if you have a library of images. It workes best when the images have some common features, such as a set of faces.

A different approach to PCA, which works for a single image, is to treat each the image matrix as a set of row vectors, and then find the principal components (PCs) for the matrix.

[coeff,score,~] = pca(double(img),'Centered',false);

img is the original monochrome image (a 2D array). coeff is the matrix of principal components. Each column of coeff is one principal component. score is the matrix of weighting factors. Each row of score is the weights needed to reconstrutct the corresponding row of img. I use double(img) to convert the values in img from unsigned integers to floating point numbers, as required by pca(). I use 'centered','false to prevent the subtraction of the mean value from each column.

Reconstruct the image adding varying amounts of the different principal component vectors. To reconstruct the image using the first 10 PCs, do this:

imgRC=uint8(score(:,1:10)*coeff(:,1:10)');

imgRC is the reconstructed monochrome image (a 2D array). I use uint8() to convert the floating point values to unsigned 8-bit integers.

This can work reasonably well for a single image. For a color image, split the color image into 3 monochrome images, and do PCA on each of them, as described above, then combine the 3 mono images to get the reconstructed color image. See code below:

img=imread(imagefile);
imgr=img(:,:,1);                %red original
imgg=img(:,:,2);                %green original
imgb=img(:,:,3);                %blue original

Then do PCA on imgr, imgg, and imgb separately, as described above. You will get three reconstructed images. Suppose they are named imgRCr, imgRCg, and imgRCb. Then you create the color reconstructed image as follows:

imgRC=cat(3,imgr,imgg,imgb);    %create color image (3D array)
imshow(imgRC);                  %display reconstructed image

The percent compression can be measured in different ways.

You could save the reconstructed image as a file, and compare its file size to the file size of the orignal image. You may see no compression (for example, if the images are both .bmp), or you may find some compression (for example if the images are both .jpg or if they are both .png). The exact amount of compression is hard to predict since JPEG and PNG have their own built-in compression algorithms. When I tried it with ben.jpg and reconstructed with 20 PCs, I got .
You could compare the number of numbers needed to represent the original and reconstructed images. . The number of numbers needed to represent the orignal monochrome image is . If you reconstruct with 20 PCs, you need 20*c numbers for the PCs, and you need 20*r numbers for the weighting factors, so . Then the compression for 20 PCs is . Image ben.jpg has 1849 rows x 1408 columns, therefore the compression with 20 PCs, measured as the ratio of numbers, is .

댓글을 달려면 로그인하십시오.

Answer 2

William Rose 2022년 7월 19일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1761175-with-pca-how-much-of-the-photo-did-i-compress#answer_1010475

MATLAB Online에서 열기

@ali yaman, The attached script applies PCA image compression to image ben.jpg. It reconstructs the image with 10 principal components and with 20 PCs. The compressed files are saved as ben10.jpg and ben20.jpg. The red, blue, and green components are compressed separately and are combined after compression.

From the comments in the script:

%Demonstrate the use of PCA for color image compression.
%An image is read from disk.  It is split into R, G, B components.
%Each color is compressed with PCA.  The compressed color slices are combined
%to reconstruct color images. Reconstructed color images are saved to disk.
%The color and R,G,B channels of the original and reconstructed images
%are displayed in low resolution as an array of images.  
%To compress a different file, change the value of imagefile.
%To compress with different numbers of PCs, change the value of numpc,
%for example, numpc=15 or [10,20] or [6,12,24] or [10,20,50,1000].
%To see the true effects of compression, the user should view the original
%and reconstructed images at full resolution.

Good luck.

댓글 수: 2
없음 표시없음 숨기기

ali yaman 2022년 7월 29일

@William Rose Thanks for your enormous endeavour. It works perfectly. And, I learned a lot new things when a look at your codes.

William Rose 2022년 7월 29일

You are welcome, Ali.

댓글을 달려면 로그인하십시오.

Answer 3

ali yaman 2022년 7월 15일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1761175-with-pca-how-much-of-the-photo-did-i-compress#answer_1008485

By the way is it possible to compress my original RGB photo which is asigned to variable a, without reduce it to just green colur( variable b) ?

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

William Rose 2022년 7월 29일

편집: William Rose 2022년 7월 29일

MATLAB Online에서 열기

ben.jpg

@ali yaman,

Yes, you can compress the original color image with PCA to make a color image. The script below does this. The original color image is called img, but you could call it a. The compressed color image is called imgRC, and you could call it b.

clear

numpc=20; %numbers of PCs to use for reconstruction

%% Read original image, extract colors, allocate arrays

img=imread('ben.jpg');

imgr=img(:,:,1); %red original

imgg=img(:,:,2); %green original

imgb=img(:,:,3); %blue original

[r,c]=size(imgr); %rows, columns in image

%% Do PCA on red

[coeffr,scorer,~] = pca(double(imgr),'Centered',false);

imgRCr=uint8(scorer(:,1:numpc)*coeffr(:,1:numpc)');

%% Do PCA on green

[coeffg,scoreg,~] = pca(double(imgg),'Centered',false);

imgRCg=uint8(scoreg(:,1:numpc)*coeffg(:,1:numpc)');

%% Do PCA on blue

[coeffb,scoreb,~] = pca(double(imgb),'Centered',false);

imgRCb=uint8(scoreb(:,1:numpc)*coeffb(:,1:numpc)');

%% Assemble color image

imgRC=cat(3,imgRCr,imgRCg,imgRCb);

%% Display images

figure;

subplot(2,4,1); imshow(img); ylabel('Original')

subplot(2,4,2); imshow(imgr); title('Red')

subplot(2,4,3); imshow(imgg); title('Green')

subplot(2,4,4); imshow(imgb); title('Blue')

subplot(2,4,5); imshow(imgRC); ylabel([num2str(numpc),' PCs'])

subplot(2,4,6); imshow(imgRCr);

subplot(2,4,7); imshow(imgRCg);

subplot(2,4,8); imshow(imgRCb);

Try the above.

ali yaman 2022년 8월 6일

It works perfectly, thanks a lot @William Rose <3

William Rose 2022년 8월 6일

You're welcome, @ali yaman.

댓글을 달려면 로그인하십시오.

With PCA, how much of the photo did i compress?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 2
없음 표시없음 숨기기

추가 답변 (2개)

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

With PCA, how much of the photo did i compress?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 2 없음 표시없음 숨기기

추가 답변 (2개)

댓글 수: 2 없음 표시없음 숨기기

댓글 수: 3 이전 댓글 1개 표시이전 댓글 1개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기