14
8

I'm a computer programmer with some interest in machine learning, but not knowing where to start if I want to get in really fast. A "Dive Into Python" kind of way. Also, it would be nice to know what other knowledge I would require.

This question is marked "community wiki".

asked Jul 02 '10 at 19:52

partoa's gravatar image

partoa
1236

edited Jul 02 '10 at 20:44

Joseph%20Turian's gravatar image

Joseph Turian ♦♦
579051125146


2 Answers:

Have a look at Data Mining: Practical Machine Learning Tools and Techniques and try to add some ML code to the Weka framework.

This answer is marked "community wiki".

answered Sep 05 '10 at 07:06

Lucian%20Sasu's gravatar image

Lucian Sasu
513172634

-1

I found Tom Mitchell's Machine Learning textbook to be very readable for a programmer. He explains the algorithms with pseudo-code. He does not use integrals or matrices.

  • Its first chapter gives you a guide how to solve a new problem.
  • Its second chapter shows that you cannot generalize without using an induction bias. For example, by considering only some hypotheses (e.g., linear models) or by preferring smooth functions and simple hypotheses.

The other chapters could be then read based on your interests.

This answer is marked "community wiki".

answered Aug 06 '10 at 10:38

Ivo%20Danihelka's gravatar image

Ivo Danihelka
25051115

2

Mitchell's book is very good, but it is really outdated, and maybe even slightly misleading, if looked at from a post-2000 viewpoint. It's missing SVMs, online learning, most learning theory surrounding SVM (VC theory, etc) and online learning, regularization, sparsity, most graphical model topics (LDA, for example), structured learning (CRFs, M3Ns), boosting, learning reductions, kernels, unsupervised learning as is common in NLP land, deep learning, etc; it also focuses a lot on decision trees, rules, genetic algorithms, and logical learning (most of which have found to be not so good, except maybe for MLNs for logical learning).

Also the reinforcement learning theory as presented in the book is very outdated.

Also, integrals and matrices are very useful in machine learning, and a real understand of the field cannot be achieved without a bit of linear algebra, probability, calculus, and, increasingly, optimization.

(Aug 06 '10 at 11:04) Alexandre Passos ♦

I agree with Alexandre, Mitchell's book misses too many now-relevant goodies.

(Aug 06 '10 at 16:40) Joseph Turian ♦♦

Thanks for mentioning what is missing. I consider Mitchell's book to be helpful, because: 1) Experienced programmer correlates with rusty integrals. 2) I haven't seen the inductive bias explained so well elsewhere. Others also like the book.

(Sep 04 '10 at 05:12) Ivo Danihelka
Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.