[Math] How to get good at reading research papers with lots of proofs

self-learning

tl;dr

To get good at math proofs (and thinking math), do you have to first memorize all the different proof tricks, or is there a way to learn as you go?

I am a software developer and have recently decided I want to start learning math. I have been running into a road block for the past year or so because I can't understand all of the equations in research papers describing their different algorithms (Conditional Random Fields, Markov Logic Networks, Probability Theory, Graphical Models, …, for machine learning, natural language processing and all that).

All of it is basically standard equations such as those for Bayes Theorem, Markov Chains, CRFs, etc., or something using propositional logic like the Markov Logic Networks, or novel theorems derived from one of these.

Looking over lots of other questions on this site, it's clear that I just need to get good at proofs, and understanding how to think mathematically / logically. I bought the book How to Prove It, and will get the other one How to Solve It soon.

My issue is, I've been studying this stuff every day for the past few weeks, and I don't feel like I am any better than when I started. In contrast, when I started learning to code, I was able to quickly get the hang of it and learn rapidly, mostly by studying other peoples code and trying to build my own programs.

I would like to do the same thing for learning math. Instead of just trying to do each exercise in a textbook which is super boring and dull, I would like to read a research paper, and slowly figure out what they are doing, learning what I need to as I run into things I can't figure out. (This is how I learned Markov Chains with no probability or set theory or anything, just had to go back and learn the basics, but at least it was applied which keeps me motivated and excited about learning the basics). I am also watching a bunch of the math lectures on coursera related to research papers I'm interested in, which is helpful.

The problem is, I feel like this isn't working with most math proofs, because there seems to be so many little tricks with all of the proofs that have to be memorized in order to be even the slightest bit effective at reading and understanding new theorems.

My question is, do I just have to sit down and memorize all kinds of these tricks to get good at math proofs, and thinking math, or is there a better way?

These are things like transformations between logical statements using DeMorgan's Law, the Conditional Law, etc. Or, is there a way to slowly pick these up? Like with programming, I can build some really cool stuff without ever having to learn algorithms or low-level computer science stuff; I just need to learn that when I run into a specific problem that requires that skill. With math it seems like the opposite, I have to first memorize a seemingly infinite number of things before I can even start looking at the research papers. Is there a more effective strategy?

Updated from a comment

Put another way, I can understand something in math (like DeMorgan's Law), but I can't make it stick. The reason is (I think) because I am not applying it in any real way (like to implementing something a research paper is talking about), so I never fully get it at an intuitive level (the same way that knowing by reading is different than knowing by doing). How do I get it at an intuitive level without having to just memorize these basics like DeMorgan's Law?

Best Answer

Mathematics, as you know, is an extremely broad subject with lots of different subfields to specialize in. Each branch of math is a life's work in itself. There's no real trick to understanding research papers - it's just time and effort devoted to understanding a field of math. So if you want to be able to understand research papers, I recommend you first try to understand research papers in one specific area by studying it intensely and reading books about it.

I faced this same problem. As a high school junior very eager to start math research, I started by tackling the hardest books at my disposal - Spivak's Calculus on Manifolds, Serre's A Course in Arithmetic, etc. These books were even recommended to me by a professor! But as I read them and pretended to understand them in periodic meetings I had with my mentor, I began to grow greatly discouraged, thinking that I'd never be able to understand these papers. For instance, in A Course in Arithmetic, I couldn't even read through the first page without having to look up terms that I never learned because I had never taken a formal class in algebra, like the characteristic of a finite field. I began to believe that math wasn't what I wanted to do with my life anymore.

But my feelings changed when my mentor referred me to a professor, and I enrolled in his 300-level undergraduate math class in number theory. Though it's a bit simple, it really fortified my foundations in proof-writing, and I'm becoming more familiar with concepts that I needed to read works like Serre's - I feel like I'm definitely making more progress than when I tackled math papers cold-turkey. Unlike before, I feel like I'm fast on-track to being able to understand the math papers that so discouraged me.

So my point is: start with what you understand, namely books and papers that are simpler, and if possible, enroll in a proof-based math class. Even if you consider such material too easy, it'll help for later, and you'll pick up the techniques of proofs on the way.

Also, if you're really intent on understanding research, pick one specific field to do so, and study it intensely from the ground up.

Best Answer

Related Solutions

[Math] Introduction to Operations Research

[Math] Self-contained undergrad math resources for someone with extremely weak foundations in math

Related Question