Professional Documents
Culture Documents
Fundamentals of Computational Neuroscience 2 Ed
Fundamentals of Computational Neuroscience 2 Ed
net/publication/220689220
CITATIONS READS
4 6,503
1 author:
Thomas Trappenberg
Dalhousie University
187 PUBLICATIONS 2,987 CITATIONS
SEE PROFILE
All content following this page was uploaded by Thomas Trappenberg on 01 June 2014.
Thomas Trappenberg
Expert 1
Integration
Expert 2 Output
network
Input
Expert n
Gating
network
f (x ) = abs (x )
X ΣΠ abs(x )
x
The ‘what-and-where’ task
Output node #
5
5 10
4 15
3 1 5 10 15 20 25 30 35
Hidden node #
2
C. With bias towards short connections
1
Output node #
1
5
1 2 3 4 5
10
15
1 5 10 15 20 25 30 35
Hidden node #
1.5
0.08 m=2
0.06 1
m=4
0.04
0.5
0.02
0 0
0 0.2 0.4 0.6 0.8 1 2 4 6 8 10
g : Relative intermodular strength m : Number of modules
Sequence learning
Overlap in A
Pathway 2000
1000
w AB w BB −1000
0 2 4 6 8 10 12 14 16 18 20
w AA w BA 2000
1500
Overlap in B
1000
500
0
−500
0 2 4 6 8 10 12 14 16 18 20
Module A Module B Time [τ]
PFC
PMC
HCMP
Node number
Node number
80 80 80
60 60 60
40 40 40
20 20 20
0 10 20 0 10 20 0 10 20
Time Time Time
Motor learning and control
Disturbance
Afferent
Re-afferent
Forward model controller
Disturbance
Inverse
model
Disturbance
+
Desired Motor command - command
Motor Controlled Actual Sensory
state
- generator object state system
Afferent
Re-afferent
Cerebellum
Basket cell
Molecular
layer { Purkinje
Purkinje { neuron Golgi cell
{
layer
Granular Granule
layer cell
Climbing fibre
Mossy fibre
Excitatory synapse
Intracerebellar Inhibitory synapse
{
and vestibular Spinal cord
nuclei External cuneate nucleus
Inferior olive Out Reticular nuclei
Pontine nuclei
Reinforcement learning
Basal Ganglia
Caudate
nucleus
Pattern 4
Subthalamic
nucleus
rhat
Pattern 3
Globus Substantia nigra
Pattern 2
pallidus
( pars compacta
pars reticulata )
0 50 100
Superior colliculus Stimulus B Stimulus A Reward Episode
temporal difference learning
r (t) r (t)
V(t) V(t)
slow slow
V(t −1) V(t −1)
V(t) fast
γ V(t)
r (t) r (t)
Actor-critique and Q-learning
Thalamus
Striosomal module
Striatum
Matrix module
SPm SPs
reward prediction
ST ST
SNc Pallidum
PD DA
action selection
Basal ganglia
(actor) (critic)
Critic
Reinforcement Disturbance
signal
Afferent
Re-afferent
Further Readings
Robert A. Jacobs, Michael I. Jordan, and Andrew G. Barto (1991), Task decomposition through
competition in a modular connectionist architecture: the what and where tasks, in
Cognitive Science 15: 219–250.
Geoffrey Hinton (1999), Products of experts, in Proceedings of the Ninth International
Conference on Artificial Neural Networks, ICANN ’99, 1:1–6.
Yaneer Bar-Yam (1997), Dynamics of complex systems, Addison-Wesley.
Edmund T. Rolls and Simon M. Stringer (1999), A model of the interaction between mood and
memory, in Networks: Comptutation in neural systems 12: 89–109.
N. J. Nilsson (1965), Learning machines: foundations of trainable pattern-classifying
systems, McGraw-Hill.
O. G. Selfridge (1958), Pandemonium: a paradigm of learning, in the mechanization of
thought processes, in Proceedings of a Symposium Held at the National Physical
Laboratory, November 1958, 511–27, London HMSO.
Marvin Minsky (1986), The society of mind, Simon & Schuster.
Akira Miyake and Priti Shah (eds.) (1999), Models of working memory, Cambridge University
Press.
Daniel M. Wolpert, R. Chris Miall, and Mitsuo Kawato (1998), Internal models in the
cerebellum, in Trends Cognitive Science 2: 338–47.
Edmund T. Rolls and Alessandro Treves (1998), Neural networks and brain function, Oxford
University Press.
James C. Houk, Joel L. Davis, and David G. Beiser (eds.) (1995), Models of information
processing in the basal ganglia, MIT Press.
Richard S. Sutton and Andrew G. Barto (1998), Reinforcement learning: an introduction, MIT
Press.
Peter Dayan and Laurence F. Abbott (2001), Theoretical Neuroscience, MIT Press.
View publication stats