|
I'm looking for a way to automatically generate a parser and a translator from a corpus of code sources and their translation in another computer language.
Background story:
In particular:
I'm not looking for a ready to consume code. If it exists please share. I'm also interested to know how the machine-learning algorithms/technics (if any) can be applied to other problems/domains. My preferred way to model data is the graph, if doesn't make sens, don't push it too hard. I don't need the program to understand the underlying knowledge that are represented in both source and target. Just learn how to go from that source to target. This is different from NLP - as I understand it. The thing that is looks like more similar is "Controlled language machine translation". But here I think that both target and source language have specific properties (like a known grammar) that makes the problem different and simpler than natural language machine translation. I was said that this was impossible and the subject of the research of a lot of brillant minds, if not the best. I did not find any articles dealing with this specific subject. If this problem is AI-complete, I'd like an explanation and references that explains why. If there is a solution, I'd like to know what it is, even if I don't have the required math level to fully understand it and even if the solution requires to be able to solve a NP-complete problem. Corpus:
|
I also asked the question @ http://stackoverflow.com/questions/22621164/how-to-automatically-generate-a-parser-code-to-code-translator-from-a-corpus