|
How does machine learning genrally handle problems where there are a variable number of inputs? To put another way, the input vector to a Machine Learning problem has to be some length, what length do you make it for vairable length data? For example, if I was learning sentences how would you handle the fact the the number of characters (or words) in a sentence can be any number?
|
|
There may be several methods to do that, dependent on the kind of data you have. For instance, I worked with variable length input data once. They were neural spike times within a window. The approach there was assume infinite temporal resolution and place a gaussian window centered at each spike time. Thus, we could integrate the resulting signal or calculate its inner product with another spike train, even if they had different spike counts. We used this inner product definition to create a Reproducing Kernel Hilbert Space. Note that this way there was no need to define a maximum size or to binaries the space... |