7. References and Further Study

Use these references to deepen specific topics after working through the handbook.

Core documentation

Foundational reading

Deep learning versus trees on tabular data

Modern tree libraries

Topics worth following up

  • post-pruning strategies for single trees
  • out-of-bag error versus cross-validation
  • correlated features and importance instability
  • when permutation importance is more trustworthy than impurity importance
  • handling categorical features across boosted-tree libraries

Suggested next step after this course

Take one real tabular workflow from your own work and compare:

  • a regularized single tree
  • a random forest or ExtraTrees model
  • one boosting implementation

Then write down:

  • what changed in performance
  • what changed in interpretability
  • what changed in tuning burden

That comparison usually teaches more than reading another round of definitions.

Previous