Disclaimer: this post contains no concrete references, examples, excerpts or solutions for any of the Coursera courses, exercises, or homework assignments.
Matrix Basics
Suppose we have a 3 by 5 matrix A like this:octave:> A = [1 2 3 4 5; 2 3 4 5 6; 3 4 5 6 7]
A =
1 2 3 4 5
2 3 4 5 6
3 4 5 6 7
Then to extract a single element from A is just like in most other languages:
octave:> A(2,3)Just remember that following mathematical conventions Octave indexes start with 1.
ans = 4
Almost everything in Octave is array (vector or matrix or similar) and index is no exception. Let's take a range for example. Range is a row vector with evenly spaced elements, e.g.:
octave:> 1:5Operation A(1:3, 1:5) returns whole A again because it selects all of its rows and columns. Ranges can exist by themselves as vectors but there is special type which is available only in the context of matrix index:
ans =
1 2 3 4 5
octave:> A(1:3,1:5)
ans =
1 2 3 4 5
2 3 4 5 6
3 4 5 6 7
octave:> A(2:end,3:end)Ranges 2:end and 3:end are defined only within concrete matrix context as keyword end indicates last row or column position within matrix. You can even select elements for last 2 rows and columns like this:
ans =
4 5 6
5 6 7
octave:> A(end-1:end,end-1:end)
ans =
5 6
6 7
Logical Operations on Matrices
Logical arrays in Octave contain all logical elements and are usually results of relational operators with vectors and matrices like this:octave:> A != 3One cool application of this is inverting identity matrix:
ans =
1 1 0 1 1
1 0 1 1 1
0 1 1 1 1
octave:> I = eye(3)
I =
Diagonal Matrix
1 0 0
0 1 0
0 0 1
octave:> I == 0
ans =
0 1 1
1 0 1
1 1 0
Euclidean Distance
Given two vectors (same size) find Euclidean distance between them.a = [0 0 0];If you find yourself writing in Octave more complex solutions for similar problems with vectors then stop and review your vectorization approach.
b = [1 2 2];
distance = sqrt(sumsq(a-b));
Vectorizing indexes
Suppose you have a collection of m vectors in n-dimensional space stored as m x n matrix X. Suppose that the value of last (n-th) coordinate of these vectors is always 0 or 1. Then we want to produce 2 subsets of X - one subset of vectors with last coordinate equal to 0 and the other subset where vectors have last coordinate equal to 1:n = size(X, 2);
X0 = X( find( X(:, n) == 0 ), :);
X1 = X( find( X(:, n) == 1 ), :);
Randomness
Random numbers appear in many problems. Basic approach is a matrix filled with random numbers from uniform distribution on the interval (0, 1):octave:> rand(3,4)But Octave offers a few shortcuts. For one, such common distributions as normal, exponential, Poisson, and gamma each receive their own function randn, rande, randp, and randg.
ans =
0.347937 0.317482 0.630678 0.245148
0.917634 0.649125 0.634592 0.837635
0.994745 0.092818 0.154936 0.966380
Function randperm produces a row vector of randomly permuted integers from 1 to n:
octave:> randperm(5)If you are looking for an arbitrary vector with values between 0 and N of size n (n<=N) then randperm gets id done (in this case N = 100 and n = 10):
ans =
3 1 4 5 2
octave:> x = randperm(100)(1:10)
x =
88 1 89 25 76 19 78 38 99 34
This combined with vector indexing accomplishes rather elaborate task in short one-liner: suppose we have m n-dimensional vectors stored as m x n matrix X and we need to pick k vectors (k < m) randomly. The following gets this done:
X(randperm(size(X, 1))(1:k), :);
Newer releases of Octave (I use 3.2.4) added functions randi and randperm(n, m) that offer even nice features.
Binary Singleton Expansion Function (bsxfun)
This function reminds me of Python map function, but having it in Octave is necessity (unlike in Python). When vectorizing in Octave we have a few options: 1) both parameters are of the same dimensions for element-wise application; 2) parameters are of compatible sizes for matrix operations like multiplication; 3) one parameter is a matrix and the other is a scalar.And then we use bsxfun for vectorizing everything else, for example applying a vector to each row:
X = rand(5, 3);
mu = mean(X);
sigma = std(X);
X_norm = bsxfun(@minus, X, mu);
X_norm = bsxfun(@divide, X_norm, sigma);
The operations above resulted in normalizing all 5 vectors (rows) from X: X_norm contains vectors with 0 means and standard deviations 1 for all 3 dimensions. In just 2 lines bsxfun applied mu (means) and sigma (standard deviations) to each row of X and X_norm.
Timing operations
Use functions tic and toc to measure execution time in Octave to tune performance when necessary:octave:> tic; A * A'; toc
Elapsed time is 2.58e-008 seconds.