Okay, time to gather all this up in an answer.
What does Griffiths show in his book? Boiled down, he shows that there exists an operator $\hat a$ such that:
The combination of these two properties imply that the Hamiltonian has a set of ladder eigenstates. These eigenstates are defined as follows:
We define $|0\rangle$ to be a state satisfying $\hat{a}|0\rangle=0$.
We define $|n\rangle\equiv\frac{1}{\sqrt{n!}}(\hat{a}^\dagger)^n|0\rangle$
Using the two properties of $\hat a$ above, we can prove that each $|n\rangle$ is an eigenstate $\hat H$ with energy $nA+B$.
We know there exists at least ONE such $\hat{a}$, because Griffiths explicitly writes it down in his book and shows it obeys both of the required properties. The question is, can there exist TWO operators with this property?
Let's say $\bar a$ is an operator, and that $\bar{a}$ obeys both properties. I.e., we have:
We follow the identical procedure as above to develop a new set of ladder states $|\bar{n}\rangle$ with energies $\bar{A}\bar{n}+\bar{B}$. There are three possible cases here: either $A\neq\bar A$, or $B\neq \bar B$, or $A=\bar A$ and $B=\bar B$ but somehow still $a\neq \bar a$. We need to show each of these possibilities are impossible.
The proofs will be proofs by contradiction. In each case, we'll select a state, and show that by acting on the state with some operator, we can construct another state with lower energy. We'll show that this process doesn't terminate (doesn't result in the zero vector) no matter how many times we do it. Thus, if we continue long enough, we get negative-energy states. Since we know the simple harmonic oscillator does not have negative energy states, we'll reach a contradiction.
Let's start with $A\neq \bar A$. Without loss of generality, assume $A>\bar{A}$. Then for some $\bar{n}$, we'll have that $|\bar{n}\rangle$ has an energy that cannot be written as $An+B$. Then, by acting with $\hat{a}$, we can generate a whole ladder of states with lower energies. The state $|\phi_m\rangle\equiv(\hat a)^m|\bar n\rangle$ has energy $E_m=\bar A\bar n+\bar B-Am$. We know that $|\phi_m\rangle$ will never equal $|0\rangle$, because $|\phi_m\rangle$ never has the same energy as $|0\rangle$. But since $|0\rangle$ is the unique vector that satisfies $\hat a |0\rangle=0$, that means this process can never terminate; every $m$ gives a nonzero vector in Hilbert space. By making $m$ large, we can make $|\phi_m\rangle$ have negative energy. But the SHO is strictly positive, so this is impossible. We conclude that we cannot have $A\neq \bar A$.
Now, let's assume $A=\bar{A}$, but $B\neq \bar{B}$. WLOG, assume $B<\bar{B}$. Then $|0\rangle$ has energy $B$, while $|\bar{0}\rangle$ has energy $\bar{B}$. In particular, since $|\bar{0}\rangle$ is the unique state which satisfies $\bar{a}|\bar 0\rangle=0$, acting on $|0\rangle$ with $\bar{a}$ produces states of arbitrarily negative energy. The state $|\phi_m\rangle\equiv(\bar a)^m|0\rangle$ has energy $B-Am$, which can be arbitrarily negative if we pick large $m$. We thus conclude that we can't have $B\neq \bar B$.
Finally, say $A=\bar{A}$ and $B=\bar{B}$. We want to show that $a$ and $\bar{a}$ are essentially the same.
We know the matrix elements of $\hat{a}$ are given by
$$
\langle m|\hat{a}|n\rangle = \sqrt{n}\delta_{n-1,m}
$$
Because $\bar{a}$ generates the same ladder of energies, and the spectrum of $H$ is non-degenerate, $\bar{a}$ connects the same states as $a$, up to phases:
$$
\langle m|\bar{a}|n\rangle = e^{i\theta_n}\sqrt{n}\delta_{n-1,m}
$$
where $\theta_n$ possibly depends on the state $n$.
That's as good as you can do: you ARE allowed to pick some random ladder operator that adds phases to your states as you go up and down the ladder. But that's the only freedom you have. Phases aren't really important to the story, so you should consider the ladder operators essentially unique. In particular, you CAN'T have a ladder operator with a different lowest rung (different $B$), and you CAN'T have a ladder operator with a different spacing (different $A$).
Best Answer
The motivation for this becomes quite clear in the Heisenberg picture using which, Dirac first introduced the ladder operators (p. 136, The Principles of Quantum Mechanics) and derived their properties in the context of the Quantum Harmonic Oscillator (QHO). Of course, using this as a motivation in a classroom would require your students to be familiar with the Heisenberg picture. I will stick with your notation and conventions $(m = \hbar = \omega = 1)$ in this answer.
Given the Hamiltonian in the Schrödinger picture for the QHO, $$ \hat{H} = \frac{1}{2} \left( \hat{X}^2 + \hat{P}^2 \right), $$ The Heisenberg equations of motion are the coupled differential equations, $$ \partial_t \hat{X}_t = [\hat{X}_t, \hat{H}_t] = \hat{P}_t \quad \text{and} \quad \partial_t \hat{P}_t = [\hat{P}_t, \hat{H}_t] = -\hat{X}_t, $$ where, the subscript indicates that the operators are time dependent in the Heisenberg picture. This is relatively straightforward to decouple. However, the key is to decouple it in a way that is reminiscent of the classical oscillator. To this end, we introduce an operator $\hat{a}^\dagger = (\hat{X} + i \hat{P})/\sqrt{2}$. This leads to the equation, $$ \partial_t \hat{a}^\dagger_t = i\hat{a}^\dagger_t \implies \hat{a}^\dagger_t = \hat{a}^\dagger_0 e^{it}.$$ One can then interpret the $\hat{X}_t$ and $\hat{P}_t$ as the real an imaginary parts of the complex amplitude $\hat{a}^\dagger_t$ and this is in perfect analogy with the classical simple harmonic oscillator. Now we can rewrite our Hamiltonian using these operators in the Schrödinger picture and naturally motivate the factorization.
Personally, I think this is a wonderful way to have students gain some insight into what these ladder operators represent before delving into why they are called ladder operators when we go into the properties. It would be particularly helpful for any student who wishes to take up QFT to be introduced to the objects in this way.