|
I need to integrate a python LDA module in my tool. It seems there are number of LDA oracles out there. I am wondering if folks here have tested any of the tools and can point me to a well tested one that I can just take and integrate with my other libraries. |
|
I'd use Matt Hoffman's implementation of VB and online VB for LDA, as it works well, is well debugged, and is a lot faster than the lda code I have online (which is poorly maintained). |
|
gensim has a fast Python implementation of LDA. It can process input larger than RAM, and it can run distributed over a cluster of computers (if you have one). The implementation is based on cleaned-up Hoffman's onlineldavb code suggested by Alexandre Passos. Processing the English Wikipedia with gensim takes 11 hours on a laptop. |