Animal learning as data compression

Instead of the more general kind of learning, let's look at what kind of qualities animal learning has, what mechanisms actually make brains work. I think that there's undoubtedly at least part of our brains (and other animal brains) that work this way. They connect stimulus to response and are able to be adapted over time, which would qualify as a form or learning. If we accept that learning is an unconscious way of storing stimulus and response pairs, then we can think of the different ways that could be accomplished.

We can imagine a very simple animal that just has one set of sensory receptors and each individual receptor is simply hardwired to stimulate a particular response - when triggered it flees light for example. And we can imagine more and more of these simple 1-to-1 sensory-to-response connections being made, each working independently of each other. Eventually though for animals with a lot of detailed senses and a lot of possible responses, they would end up with lots of redundant neurological wiring. A better system would be to find areas where there was overlap and create a network that shared data transmission resources and used logic to make sure that the appropriate stimulus-response pairs we're still connected correctly. This would in essence be a form of data compression.

As a hypothetical example we can think of a neural network in a digital animal that connects the stimulus of 4 binary sensors with a 7-segment number display, and the purpose of this neural network is to convert the numeric value from binary to the display. A simple and straightforward way of doing it would be to have 10 neurons, each with 4 dendrites to sense the binary stimulus and 7 dendrites to active the appropriate segments of the display. If we measure the complexity of each neuron by how many boolean logic gates it would take to 'program' it, the average would be around 4 each, for a total of at least 40. This is simple, but not very efficient way of encoding these stimulus-response behavior. For comparison, a typical binary to 7-segment decoder can be made with just over 20 logic gates.

That simple hypothetical seems to have a lot in common with the kind of neurological structures we see in actual animal brains. There's a huge amount of overlap, while still efficiently connecting stimulus and response pairs. If those pairs are fixed, that's usually what we'd call instinct, but if there's a process in place so that the network can change connections to maintain most existing pairs while adding or updating new ones, that sounds like learning.

This kind of 'compression' would also lead to a de facto set of predictions for the kinds of responses that would be triggered by similar stimulus that haven't been observed before. It may be that the response to a new stimulus is close enough to previously learned ones to be useful, or it could even be that a new stimulus is essentially the combination of two previous stimuli and that the way the responses have been learned/compressed creates an adaptive prediction of the correct response for the new situation.

Of course a predicted response could just be an artifact of the 'compression' and might not be adaptive at all, in which case feedback would lead to rewiring that attempts to preserve the previous 'correct' responses while integrating a new response in to the existing 'compression' scheme. Which is essentially a description of learning given this perspective. What kinds of new responses are able to be learned would be dependent on how similar they are to existing responses and the cost/benefit of learning the new responses as opposed to relying on a 'good enough' response. Cases in which it's not evolutionarily advantageous to override a large body of compressed/learned responses to integrate a new and unusual response could be thought of as behavioral biases - consistent non-optimal responses.

Thinking of the brains of animals this way would lead to some testable predictions. We'd expect animals that have more complex environments and/or more complex learned behaviors to have to have larger brains to encode the larger set of responses. It also would make sense that the entire brain wouldn't be wired this way, there would be areas that are hardwired as instincts. We'd also expect to see predictable patterns in the way stimulus-response pairs are encoded/compressed.

Learning and Intelligence

Additional thoughts on learning and intelligence In previous posts I've looked at the idea that learning is the process of connecting responses to stimulus , and that the actual way that learning happens in animals is essentially a form of data compression. If we think of learning like this, then we should ask how the data compression works, what process or rules does it follow if any? The key to good data compression is to find patterns in the data and take advantage of that, and the world we live in comes ready made with rules that govern how it works, the physical laws of the universe. It would seem that it would be possible for our brains to take advantage of this to compress responses to stimuli efficiently, and that that process could have the beneficial side effect of making the prediction of physical processes easier. The learning process would not only be encoding individual responses, but also patterns in those responses and the rules that determine what shape those...

Definition Mining Co.

Search This Blog

Animal learning as data compression

Labels

Popular posts from this blog

Intelligence

Learning and Intelligence