What is a determinant?

The determinant is a scalar value defined for square matrices that naturally arises when solving systems of linear equations by elimination. For a 2×2 matrix, det(A) = a₁₁a₂₂ − a₁₂a₂₁. When this value is nonzero, the system has a unique solution. Historically, it was independently discovered by Seki Takakazu (c. 1683) and Leibniz (1693).

What is Cramer's rule?

Cramer's rule expresses the solution of a system of linear equations Ax = b using determinants. The j-th component is given by xⱼ = det(Aⱼ) / det(A), where Aⱼ is the matrix obtained by replacing the j-th column of A with the vector b. It requires det(A) ≠ 0, and is computationally expensive, so Gaussian elimination is preferred in practice.

What happens when the determinant is zero?

When det(A) = 0, the system Ax = b does not have a unique solution. Either no solution exists (inconsistent system) or infinitely many solutions exist (underdetermined system). In 2D, this corresponds to two lines being parallel or coincident.

Deriving the Determinant: Systems of Equations and Cramer's Rule

Goals

Show that the determinant naturally arises when solving the system of linear equations $A\boldsymbol{x} = \boldsymbol{b}$. This is historically the oldest approach, and the practical motivation is clear from the start.

Prerequisites

Chapter 1: Vector Space Basics — Basic vector and matrix operations
Chapter 2: Linear Independence — Definition and criteria for linear independence

§1. Starting with a 2×2 System

1.1 Problem Setup

A system of two equations in two unknowns $x, y$:

\begin{cases} a_{11} x + a_{12} y = b_1 \\ a_{21} x + a_{22} y = b_2 \end{cases}

1.2 Solving for $x$: Eliminating $y$

Multiply the first equation by $a_{22}$ and the second by $a_{12}$, then subtract:

\begin{align} a_{22}(a_{11} x + a_{12} y) &= a_{22} b_1 \\ a_{12}(a_{21} x + a_{22} y) &= a_{12} b_2 \end{align}

Subtracting eliminates $y$:

$$(a_{11} a_{22} - a_{12} a_{21}) x = a_{22} b_1 - a_{12} b_2$$

1.3 The Determinant Appears

If $a_{11} a_{22} - a_{12} a_{21} \neq 0$, then:

$$x = \dfrac{a_{22} b_1 - a_{12} b_2}{a_{11} a_{22} - a_{12} a_{21}}$$

The expression in the denominator:

$$\det(A) = a_{11} a_{22} - a_{12} a_{21} = \det\begin{pmatrix} a_{11} & a_{12} \\ a_{21} & a_{22} \end{pmatrix}$$

This is the determinant. It arises naturally from the process of solving the system!

Figure 1: The determinant arises naturally from the elimination process

1.4 Solving for $y$: Eliminating $x$

Similarly, multiplying the first equation by $a_{21}$ and the second by $a_{11}$, then subtracting, eliminates $x$:

$$(a_{11} a_{22} - a_{12} a_{21}) y = a_{11} b_2 - a_{21} b_1$$

The coefficient of $y$ is the same $a_{11} a_{22} - a_{12} a_{21} = \det(A)$ as before. Dividing both sides by $\det(A) \neq 0$:

$$y = \dfrac{a_{11} b_2 - a_{21} b_1}{a_{11} a_{22} - a_{12} a_{21}}$$

1.5 The Numerators Are Also Determinants

The numerator for $x$, $a_{22} b_1 - a_{12} b_2$, can be written as a determinant:

$$a_{22} b_1 - a_{12} b_2 = \det\begin{pmatrix} b_1 & a_{12} \\ b_2 & a_{22} \end{pmatrix}$$

This is the determinant of the matrix obtained by replacing the first column of $A$ with $\boldsymbol{b}$.

Similarly, the numerator for $y$, $a_{11} b_2 - a_{21} b_1$, is also a determinant:

$$a_{11} b_2 - a_{21} b_1 = \det\begin{pmatrix} a_{11} & b_1 \\ a_{21} & b_2 \end{pmatrix}$$

This is the determinant of the matrix obtained by replacing the second column of $A$ with $\boldsymbol{b}$.

In summary:

$$x = \dfrac{\det\begin{pmatrix} b_1 & a_{12} \\ b_2 & a_{22} \end{pmatrix}}{\det(A)}, \quad y = \dfrac{\det\begin{pmatrix} a_{11} & b_1 \\ a_{21} & b_2 \end{pmatrix}}{\det(A)}$$

In both cases, we see a unified pattern: replace the column corresponding to the desired variable with $\boldsymbol{b}$. This is the essence of Cramer's rule.

§2. Cramer's Rule

2.1 The 2×2 Case

For $A = \begin{pmatrix} a_{11} & a_{12} \\ a_{21} & a_{22} \end{pmatrix}$ and $\boldsymbol{b} = \begin{pmatrix} b_1 \\ b_2 \end{pmatrix}$:

Cramer's Rule ($n = 2$)

$$x = \dfrac{\det\begin{pmatrix} b_1 & a_{12} \\ b_2 & a_{22} \end{pmatrix}}{\det(A)}, \quad y = \dfrac{\det\begin{pmatrix} a_{11} & b_1 \\ a_{21} & b_2 \end{pmatrix}}{\det(A)}$$

Figure 2: Cramer's rule — replace the j-th column of A with b to find xⱼ

2.2 Generalization to $n$ Dimensions

For an $n$-variable system $A\boldsymbol{x} = \boldsymbol{b}$, let $A_j$ denote the matrix obtained by replacing the $j$-th column of $A$ with $\boldsymbol{b}$:

Cramer's Rule (General)

$$x_j = \dfrac{\det(A_j)}{\det(A)} \quad (j = 1, 2, \ldots, n)$$

provided $\det(A) \neq 0$.

For $n = 2$, we proved this directly by elimination in Section 1. A rigorous proof for general $n$ requires cofactor expansion and the theory of adjugate matrices, which we omit here. Instead, let us verify the formula on a concrete 3×3 example.

Note: Computing 3×3 Determinants (Sarrus' Rule)

A 3×3 determinant is computed as the sum of products along downward diagonals minus the sum of products along upward diagonals:

$$\det\begin{pmatrix} a & b & c \\ d & e & f \\ g & h & i \end{pmatrix} = aei + bfg + cdh - ceg - afh - bdi$$

This method works only for 3×3 matrices. For 4×4 and larger, use cofactor expansion (Chapter 4).

2.3 Worked Example: A 3×3 System

Consider the following system:

\begin{cases} x + y + z = 6 \\ 2x + 3y + z = 11 \\ x + 2y + 3z = 14 \end{cases}

The coefficient matrix and right-hand side vector:

$$A = \begin{pmatrix} 1 & 1 & 1 \\ 2 & 3 & 1 \\ 1 & 2 & 3 \end{pmatrix}, \quad \boldsymbol{b} = \begin{pmatrix} 6 \\ 11 \\ 14 \end{pmatrix}$$

First, compute $\det(A)$ by expanding along the first row:

$$\det(A) = 1 \cdot (9-2) - 1 \cdot (6-1) + 1 \cdot (4-3) = 7 - 5 + 1 = 3$$

$x$ — Replace the first column of $A$ with $\boldsymbol{b}$:

$$\det(A_1) = \det\begin{pmatrix} \color{red}{6} & 1 & 1 \\ \color{red}{11} & 3 & 1 \\ \color{red}{14} & 2 & 3 \end{pmatrix} = 6(9-2) - 1(33-14) + 1(22-42) = 3, \quad x = \dfrac{3}{3} = 1$$

$y$ — Replace the second column of $A$ with $\boldsymbol{b}$:

$$\det(A_2) = \det\begin{pmatrix} 1 & \color{red}{6} & 1 \\ 2 & \color{red}{11} & 1 \\ 1 & \color{red}{14} & 3 \end{pmatrix} = 1(33-14) - 6(6-1) + 1(28-11) = 6, \quad y = \dfrac{6}{3} = 2$$

$z$ — Replace the third column of $A$ with $\boldsymbol{b}$:

$$\det(A_3) = \det\begin{pmatrix} 1 & 1 & \color{red}{6} \\ 2 & 3 & \color{red}{11} \\ 1 & 2 & \color{red}{14} \end{pmatrix} = 1(42-22) - 1(28-11) + 6(4-3) = 9, \quad z = \dfrac{9}{3} = 3$$

Verification: Substituting $(x, y, z) = (1, 2, 3)$ into the original equations gives $1+2+3=6$, $2+6+3=11$, $1+4+9=14$, which all check out.

§3. The Case $\det(A) = 0$

3.1 Relation to Uniqueness of Solutions

When $\det(A) \neq 0$, the system $A\boldsymbol{x} = \boldsymbol{b}$ has a unique solution.

When $\det(A) = 0$:

No solution exists (inconsistent system), or
Infinitely many solutions exist (underdetermined system)

3.2 Geometric Interpretation

In 2D, $\det(A) = 0$ means the two lines are:

Parallel (no intersection), or
Coincident (infinitely many intersections)

Figure 3: The relationship between det(A) and the solutions of a 2D system

In 3D, $\det(A) = 0$ means one of the following:

the three planes do not intersect at a single point (no solution);
the three planes intersect along a common line (infinitely many solutions);
the three planes coincide (infinitely many solutions);
two or more planes are parallel and do not intersect (no solution).

§4. Historical Background

4.1 Origins of the Determinant

The determinant was discovered through the process of solving systems of equations:

Seki Takakazu (c. 1683): Discovered in Japan while studying elimination methods for systems of equations
Leibniz (1693): Independently discovered in Europe
Cramer (1750): Formulated the general rule

4.2 Etymology

The name "determinant" comes from the fact that it determines whether a system of equations has a unique solution.

§5. Pros and Cons of This Approach

5.1 Advantages

Clear motivation: Starts from the natural problem "solve a system of equations"
Immediate usefulness: Directly connected to a solution formula
Historically authentic: Follows the actual path of discovery

5.2 Disadvantages

The formula seems to appear "out of thin air": Hard to see why it takes this particular form
Geometric meaning is deferred: The connection to volume requires separate explanation
Computationally expensive: Not practical for solving systems (Gaussian elimination is more efficient)

§6. Relation to Inverse Matrices

6.1 Inverse Matrix Formula

Expressing Cramer's rule in matrix language yields the inverse matrix formula:

$$A^{-1} = \dfrac{1}{\det(A)} \mathrm{adj}(A)$$

where $\mathrm{adj}(A)$ is the adjugate matrix (the transpose of the cofactor matrix).

6.2 $\det(A) \neq 0 \Leftrightarrow A$ Is Invertible

A matrix $A$ has an inverse (is invertible/nonsingular) if and only if $\det(A) \neq 0$.

This is another reason for the name "determinant" — it determines whether the matrix is invertible.

§7. Summary

Key Takeaways

The determinant was born from systems of equations: It appears naturally during the solution process
Cramer's rule: $x_j = \det(A_j) / \det(A)$
$\det(A) \neq 0$ ⇔ unique solution exists ⇔ $A$ is invertible
Historically the oldest approach: Seki Takakazu, Leibniz, Cramer

Related Pages:

Cofactor Expansion and Inverse Matrices: Expansion formulas and deriving the inverse
Determinants and Shear Transformations: Geometric meaning of the determinant
Computing Determinants via Row Reduction: An algorithmic perspective
History of Determinant Computation: 340 years from Cramer's rule to Strassen's fast algorithm