I need to integrate a python LDA module in my tool. It seems there are number of LDA oracles out there. I am wondering if folks here have tested any of the tools and can point me to a well tested one that I can just take and integrate with my other libraries.

asked Feb 27 '11 at 13:57

Mark%20Alen's gravatar image

Mark Alen
1323234146


2 Answers:

I'd use Matt Hoffman's implementation of VB and online VB for LDA, as it works well, is well debugged, and is a lot faster than the lda code I have online (which is poorly maintained).

answered Feb 27 '11 at 15:02

Alexandre%20Passos's gravatar image

Alexandre Passos ♦
2554154278421

gensim has a fast Python implementation of LDA. It can process input larger than RAM, and it can run distributed over a cluster of computers (if you have one).

The implementation is based on cleaned-up Hoffman's onlineldavb code suggested by Alexandre Passos. Processing the English Wikipedia with gensim takes 11 hours on a laptop.

answered Mar 24 '11 at 02:50

Radim's gravatar image

Radim
38679

edited Mar 24 '11 at 09:22

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.