The Glowing Python: PCA and image compression with numpy

Wednesday, July 27, 2011

PCA and image compression with numpy

In the previous post we have seen the princomp function. This function performs principal components analysis (PCA) on the n-by-p data matrix and uses all the p principal component to computed the principal component scores. In this new post, we will see a modified version of the princomp where the representation of the original data in the in the principal component space is computed with less than p principal components:

from numpy import mean,cov,cumsum,dot,linalg,size,flipud

def princomp(A,numpc=0):
 # computing eigenvalues and eigenvectors of covariance matrix
 M = (A-mean(A.T,axis=1)).T # subtract the mean (along columns)
 [latent,coeff] = linalg.eig(cov(M))
 p = size(coeff,axis=1)
 idx = argsort(latent) # sorting the eigenvalues
 idx = idx[::-1]       # in ascending order
 # sorting eigenvectors according to the sorted eigenvalues
 coeff = coeff[:,idx]
 latent = latent[idx] # sorting eigenvalues
 if numpc < p and numpc >= 0:
  coeff = coeff[:,range(numpc)] # cutting some PCs if needed
 score = dot(coeff.T,M) # projection of the data in the new space
 return coeff,score,latent

The following code uses the new version of the princomp to compute the PCA of a matrix that represents an image in gray scale. The PCA is computed ten times with an increasing number of principal components. The script show the images reconstructed using less than 50 principal components (out of 200).

from pylab import imread,subplot,imshow,title,gray,figure,show,NullLocator
A = imread('shakira.jpg') # load an image
A = mean(A,2) # to get a 2-D array
full_pc = size(A,axis=1) # numbers of all the principal components
i = 1
dist = []
for numpc in range(0,full_pc+10,10): # 0 10 20 ... full_pc
 coeff, score, latent = princomp(A,numpc)
 Ar = dot(coeff,score).T+mean(A,axis=0) # image reconstruction
 # difference in Frobenius norm
 dist.append(linalg.norm(A-Ar,'fro'))
 # showing the pics reconstructed with less than 50 PCs
 if numpc <= 50:
  ax = subplot(2,3,i,frame_on=False)
  ax.xaxis.set_major_locator(NullLocator()) # remove ticks
  ax.yaxis.set_major_locator(NullLocator())
  i += 1 
  imshow(flipud(Ar))
  title('PCs # '+str(numpc))
  gray()

figure()
imshow(flipud(A))
title('numpc FULL')
gray()
show()

The resulting images:

We can see that 40 principal components are enough to reconstruct the original image.

At the end of this experiment, we can plot the distance of the reconstructed images from the original image in Frobenius norm (red curve) and the cumulative sum of the eigenvalues (blue curve). Recall that the cumulative sum of the eigenvalues shows the level of variance accounted by each of the corresponding eigenvectors. On the x axis there is the number of eigenvalues/eigenvectors used.

from pylab import plot,axis
figure()
perc = cumsum(latent)/sum(latent)
dist = dist/max(dist)
plot(range(len(perc)),perc,'b',range(0,full_pc+10,10),dist,'r')
axis([0,full_pc,0,1.1])
show()

24 comments:

AnonymousMarch 31, 2013 at 4:16 PM
I know this is an old post ( and I appreciated it as a reference ), but I was amused at the following:
In you code you
from numpy import mean,cov,cumsum,dot,linalg,size,flipud
from pylab import *

The from pylab import * overwrites *everything* you imported from numpy in the global namespace. =)
ReplyDelete
Replies
AmenhotepOctober 20, 2013 at 5:38 PM
JustGlowing, you say in your program:

if numpc < p or numpc >= 0: # then cut some PCs

Should it be "and" instead of "or" there?
ReplyDelete
Replies
SinooshkaMarch 14, 2014 at 12:55 AM
Could you please explain about this line:

Ar = dot(coeff,score).T+mean(A,axis=0) # image reconstruction

I confused where you add the old subtracted mean of the original matrix to the dot product. The Original data is already projected onto a new vector space, are we allowed to add that mean subtracted from the original data back to the new scores? I also do not understand why should we multiply the scores again to the eigenvectors!? By this time We have the sorted scores and taking the first, second ets scores will give us the projected data onto the PC1 and PC2 ... so why didn't you just take the first 'n' scores in order to show the data in the reduced space?
ReplyDelete
Replies
SinooshkaMarch 16, 2014 at 9:16 PM
Thank you For the clarification JustGlowing !
I also have another question :
What if we want to apply PCA on the Original Image with all 3 band? In that case how we are going to define our latent variables and consequently the coefficients ?
ReplyDelete
Replies
SinooshkaMarch 16, 2014 at 10:17 PM
OK , now lets think that I have a cube (a hyperspectral image) with not 3 bands but lets say 200 bands... now I want to reduce the number of these band into lest say 5 bands ... What I need to do is to roll out each band (2D image) into a 1D vector and create a (n x 200) matrix from them ... now I want to do PCA or any dimension reduction method on this data. The question is Is that a correct way of using PCA on this data ?

ReplyDelete
Replies
SinooshkaMarch 31, 2014 at 2:27 PM
Just in case for others having the same question:

In order to apply conventional PCA to a hypercube, it is
necessary to ‘unfold’ the hypercube into a two-dimensional
matrix in which each row represents the spectrum of 1 pixel.

http://onlinelibrary.wiley.com/doi/10.1002/cem.1127/pdf
ReplyDelete
Replies
AnonymousMay 22, 2014 at 1:57 PM
why do you use

M = (A-mean(A.T,axis=1)).T

for example

A= np.array([[1,2],[2,3],[7,6]])
array([[1, 2],
[2, 3],
[7, 6]])

M = (A-np.mean(A.T,axis=1)).T

array([[-2.33333333, -1.33333333, 3.66666667],
[-1.66666667, -0.66666667, 2.33333333]])

so matrix seems to change dimensions
ReplyDelete
Replies
AnonymousJuly 17, 2015 at 12:36 AM
Hi, thanks for this useful post.

Is this line from the 2nd block of code correct?

A = mean(A,2) # to get a 2-D array

Shouldn't imread load the .jpg as a 2D array already? And why would you use numpy.mean() here to "get a 2-D array" ?

Thanks for your time.

ReplyDelete
Replies
UnknownOctober 30, 2015 at 1:04 PM
Hi JustGlowing,

I have just tried running this program but it won't work. It says:

File "C:\Users\Martin Dalgaard\Anaconda\lib\site-packages\numpy\core\_methods.py", line 50, in _count_reduce_items
items *= arr.shape[ax]

IndexError: tuple index out of range

Since the error seems to be in the numpy files I don't know how to solve it in my code. I hope you can help me. Thanks in advance. :-)

- Martin Dalgaard
ReplyDelete
Replies
AnonymousApril 5, 2016 at 4:42 AM
hi!, Ihave a doubt,i have tried runnig your program but i can´t see nothing and only see: Process finished with exit code 0....
ReplyDelete
Replies
UnknownJune 20, 2017 at 6:46 AM
hi, how can i apply this code for more than 10 images. pls help me
ReplyDelete
Replies
benjiiSeptember 6, 2018 at 10:24 AM
Hi,

I know this is an old post, but somtimes I try to use your function (not always), I get the following :

---> 19 imshow(Ar)
... (bla bla)
TypeError: Image data cannot be converted to float

The reason for this is that print(Ar[0][0]) gives back :
(61.335897435897614+0j)

Which is a complex number.

At first I thought it was because I used a colored image, but after several tries, it seems like every pic that it wider than it is tall has this problem, whereas pics that follow your original pic pattern (taller than wider, if you will), works with your code.

Any idea what's wrong here and how to fix it ?
ReplyDelete
Replies
sundarAugust 4, 2019 at 9:48 PM
Excellent post! searching for this example for quite long time...Thanks.
ReplyDelete
Replies
UnknownOctober 26, 2019 at 9:24 PM
Very helpful for PCA image compression thanks
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.

Wednesday, July 27, 2011

PCA and image compression with numpy

24 comments:

Quote