$\newcommand{\ran}{\operatorname{ran}}$Here’s one way to think about it. Suppose that $y\in\ran f$; then we can pull $y$ back to $f^{-1}(y)\in X$. If $f^{-1}(y)\in\ran g$, we can pull it back to $g^{-1}(f^{-1}(y))\in Y$. If we continue this pulling back, one of two things must happen: either we reach a dead end at a point of $X$ or $Y$ that can’t be pulled back (because it’s in $Y\setminus\ran f$ or $X\setminus\ran g$), or we don’t.
Let $X_0=X\setminus\ran g$, the set of points of $X$ that cannot be pulled back at all, and let $Y_0=Y\setminus\ran f$. More generally, for each $n\in\omega$ let $X_n$ be the set of points of $X$ that can be pulled back exactly $n$ times, and let $Y_n$ be the set of points of $Y$ that can be pulled back exactly $n$ times. Finally, let $X_\omega$ and $Y_\omega$ by the subsets of $X$ and $Y$, respectively whose points can be pulled back infinitely many times.
At this point a sketch helps; it should show the partitions $\{X_n:n\le\omega\}$ of $X$ and $\{Y_n:n\le\omega\}$ of $Y$, and it should include arrows indicating what parts of $X$ get mapped to what parts of $Y$ and vice versa. To avoid having arrows crossing, I’ve taken $X$ and $Y$ apart in the following diagram.
$$\begin{array}{}
X_0&\overset{f}\longrightarrow&Y_1&\overset{g}\longrightarrow&X_2&\overset{f}\longrightarrow& Y_3&\overset{g}\longrightarrow&X_4&\dots&X_\omega\\
Y_0&\overset{g}\longrightarrow&X_1&\overset{f}\longrightarrow&Y_2&\overset{g}\longrightarrow&X_3&\overset{f}\longrightarrow&Y_4&\dots&Y_\omega
\end{array}$$
Each of the arrows is a bijection, so I can break up the diagram into $\omega$ self-contained parts. The first two parts are:
$$\begin{array}{}
X_0&\overset{f}\longrightarrow&Y_1\\
Y_0&\overset{g}\longrightarrow&X_1
\end{array}\qquad
\begin{array}{}
X_2&\overset{f}\longrightarrow&Y_3\\
Y_2&\overset{g}\longrightarrow&X_3
\end{array}$$
Ignoring $X_\omega$ and $Y_\omega$ for the moment, I can rearrange the rest of the diagram to give my a bijection from $X\setminus X_\omega$ to $Y\setminus Y_\omega$:
$$\begin{array}{ccc}
X_0&\overset{f}\longrightarrow&Y_1\\
X_1&\overset{g^{-1}}\longrightarrow&Y_0\\
X_2&\overset{f}\longrightarrow&Y_3\\
X_3&\overset{g^{-1}}\longrightarrow&Y_2\\
\vdots&\vdots&\vdots\\
X_{2k}&\overset{f}\longrightarrow&Y_{2k+1}\\
X_{2k+1}&\overset{g^{-1}}\longrightarrow&Y_{2k}\\
\vdots&\vdots&\vdots
\end{array}$$
Finally, I claim that $f[X_\omega]=Y_\omega$: everything in $X_\omega$ can be pulled back infinitely often, so everything in $f[X_\omega]$ can be pulled back infinitely often, and therefore $f[X_\omega]\subseteq Y_\omega$. On the other hand, if $y\in Y_\omega$, then $y$ can be pulled back infinitely often, so it must be possible to pull $f^{-1}(y)$ back infinitely often, and therefore $f^{-1}(y)\in X_\omega$. Thus, $Y_\omega\subseteq f[X_\omega]$ as well. The diagram above can now be completed to show a bijection from $X$ onto $Y$:
$$\begin{array}{ccc}
X_0&\overset{f}\longrightarrow&Y_1\\
X_1&\overset{g^{-1}}\longrightarrow&Y_0\\
X_2&\overset{f}\longrightarrow&Y_3\\
X_3&\overset{g^{-1}}\longrightarrow&Y_2\\
\vdots&\vdots&\vdots\\
X_{2k}&\overset{f}\longrightarrow&Y_{2k+1}\\
X_{2k+1}&\overset{g^{-1}}\longrightarrow&Y_{2k}\\
\vdots&\vdots&\vdots\\
X_\omega&\overset{f}\longrightarrow&Y_\omega
\end{array}$$
The bijection is defined piecewise, but that’s no problem.
There are a few details to be filled in to make this a fully rigorous proof, but I think that it does give a reasonable idea of one possible intuition.
Added: Here’s a very rough sketch. Arrows from left to right are (parts of) $f$, and arrows from right to left are (parts of) $g$.
Best Answer
Hope you've solved it by now, but here is a complete proof for others who may find themselves looking for answers
Showing the first inclusion is straight forward
def. $f(A) = \{f(x) : x \in A \}$
def. $f^{-1}(B) = \{x : f(x) \in B \}$
Now all that needs to be shown is that if $f$ is injective $A \supset f^{-1}(f(A))$
def. $f:X \rightarrow Y$ is injective $\Leftrightarrow (f(x)=f(y) \implies x = y)$
Proof by contradiction. Assume the opposite
Thus we have $A \supset f^{-1}(f(A))$ and thereby $A = f^{-1}(f(A))$