Welcome to Scribd!

Skip carousel

Restricted Boltzmann Machines

Uploaded by

hputluri

0% found this document useful (0 votes)

7 views8 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

7 views8 pages

Restricted Boltzmann Machines

Uploaded by

hputluri

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 8

Search inside document

Neural Networks and Deep Learning

Dr. Srikanth Thota

Dr. Srikanth Thota Neural Networks and Deep Learning

Restricted Boltzmann Machine

The Boltzmann machine was fully observed. Will have hidden units as
well.
A classic architecture called the restricted Boltzmann machine assumes
a bipartite graph over the visible units and hidden units:
A bipartite graph (or bigraph) is a graph whose vertices can be divided
into two disjoint and independent sets U and V, that is every edge
connects a vertex in U to one in V.
A complete bipartite graph or biclique is a special kind of bipartite
graph where every vertex of the first set is connected to every vertex of
the second set.
The hidden units learn more abstract features of the data.

Dr. Srikanth Thota Neural Networks and Deep Learning

Restricted Boltzmann Machine

RBM has binary-valued hidden and visible units, and consists of a matrix of weights w of
size m × n.
Each weight element (wi,j ) of the matrix is associated with the connection between the
visible (input) unit vi and the hidden unit hj .
There are bias weights (offsets) ai for vi and bj for hj .
Given the weights and biases,
∑ the energy
∑ of a configuration
∑∑ (pair of boolean vectors) (v,h)
is defined as E(v, h) = − i ai vi − j bj hj − i j vi wi,j hj

In matrix notation, E(v, h) = −aT v − bT h − vT Wh.

This energy function is analogous to that of a Hopfield network.

Dr. Srikanth Thota Neural Networks and Deep Learning

Restricted Boltzmann Machine

The joint probability distribution for the visible and hidden vectors is defined in terms of
the energy function as follows
P(v, h) = Z1 e−E(v,h)
where Z is a partition function defined as the sum of e−E(v,h) over all possible
configurations, which can be interpreted as a normalizing constant to ensure that the
probabilities sum to 1.
The marginal probability of a visible vector is the sum of P(v, h) over all possible hidden
1 ∑ −E(v,h)
layer configurations, P(v) = e ,and vice versa.
Z
{h}

Dr. Srikanth Thota Neural Networks and Deep Learning

Restricted Boltzmann Machine

The hidden unit activations are mutually independent given the visible unit activations and
vice versa.
For m visible units and n hidden units, the conditional probability of a configuration
∏m of the
visible units v, given a configuration of the hidden units h, is P(v|h) = i=1 P(vi |h).
∏n
Conversely, the conditional probability of h given v is P(h|v) = j=1 P(hj |v).

Individual activation probabilities

( )
∑ m
P(hj = 1|v) = σ bj + wi,j vi
( ∑n
i=1 )
P(vi = 1|h) = σ ai + j=1 wi,j hj where σ denotes the logistic sigmoid.

Dr. Srikanth Thota Neural Networks and Deep Learning

Restricted Boltzmann Machine
To estimate the model statistics for the negative update, start from the data and run a
few steps of Gibbs sampling.
By the conditional independence property, all the hiddens can be sampled in parallel, and
then all the visibles can be sampled in parallel.

This procedure is called contrastive divergence.

It’s a good approximation to the model distribution.

Dr. Srikanth Thota Neural Networks and Deep Learning

Restricted Boltzmann Machine
Contrastive Divergence(CD) Algorithm steps for single sample
For a training sample v, compute the probabilities of the hidden units and sample a hidden
activation vector h from this probability distribution.
Compute the outer product of v and h and call this the positive gradient.
From h, sample a reconstruction v’ of the visible units, then resample the hidden
activations h’ from this. (Gibbs sampling step)
Compute the outer product of v’ and h’ and call this the negative gradient.
Update to the weight matrix W is the positive gradient minus the negative gradient, times
some learning rate:
∆W = η(vhT − v′ h′T )

Update the biases a and b analogously

∆a = η(v − v′ )
∆b = η(h − h′ )

Dr. Srikanth Thota Neural Networks and Deep Learning

Thank You

Dr. Srikanth Thota Neural Networks and Deep Learning

The mathematics of quantum mechanics
From Everand
The mathematics of quantum mechanics
Alessio Mangoni
No ratings yet
00 - Perceptron - Scientific Machine Learning (SciML)
Document42 pages
00 - Perceptron - Scientific Machine Learning (SciML)
Sai Asrith Pyla
No ratings yet
Introduction To RBM
Document11 pages
Introduction To RBM
Emmanuel Mecanosaurio
No ratings yet
Restricted Boltzmann Machines: Abstract
Document21 pages
Restricted Boltzmann Machines: Abstract
Nasreen
No ratings yet
Spair 16
Document4 pages
Spair 16
Carlos Javier
No ratings yet
Boltzmann Machines
Document7 pages
Boltzmann Machines
sharathyh kumar
No ratings yet
Rectified Linear Units Improve Restricted Boltzmann Machines
Document8 pages
Rectified Linear Units Improve Restricted Boltzmann Machines
Vatsal
No ratings yet
Machinelearninghghghg
Document17 pages
Machinelearninghghghg
MaRsHall
No ratings yet
Quantum Deep Learning
Document34 pages
Quantum Deep Learning
pog
No ratings yet
Lecture 1 - Computational Intelligence
Document16 pages
Lecture 1 - Computational Intelligence
Parveen Kumar
No ratings yet
Unit - I Artificial Neural Networks
Document23 pages
Unit - I Artificial Neural Networks
Mary Morse
No ratings yet
Tutorial
Document6 pages
Tutorial
spwajeeh
No ratings yet
Name-Surya Pratap Singh Enrolment No. - 0829IT181023 Subject - Soft Computing Subject Code - IT701
Document17 pages
Name-Surya Pratap Singh Enrolment No. - 0829IT181023 Subject - Soft Computing Subject Code - IT701
Surya Pratap
No ratings yet
QPC 2
Document15 pages
QPC 2
akash rawat
No ratings yet
Hebbian Learning and Gradient Descent Learning: Neural Computation: Lecture 5
Document20 pages
Hebbian Learning and Gradient Descent Learning: Neural Computation: Lecture 5
richa
No ratings yet
Back Prop in Cortex 2007
Document16 pages
Back Prop in Cortex 2007
daniel
No ratings yet
Deep Learning Networks For Off-Line Handwritten Signature Recognition
Document10 pages
Deep Learning Networks For Off-Line Handwritten Signature Recognition
ramimj
No ratings yet
3 DeltaRule PDF
Document10 pages
3 DeltaRule PDF
Es E
No ratings yet
I. Lyapunov Theorem For Stability Analysis
Document2 pages
I. Lyapunov Theorem For Stability Analysis
Marco Neve
No ratings yet
Lecture Notes 1 - Supervised Learning & Perceptron
Document3 pages
Lecture Notes 1 - Supervised Learning & Perceptron
myname is
No ratings yet
A Neural Implementation of Bayesian Inference Based On Predictive Coding
Document29 pages
A Neural Implementation of Bayesian Inference Based On Predictive Coding
Alan Hartog
No ratings yet
Ai Ca3 New
Document21 pages
Ai Ca3 New
Navdeep Singh
No ratings yet
Learning Rules For Multilayer Feedforward Neural Networks
Document19 pages
Learning Rules For Multilayer Feedforward Neural Networks
Stefanescu Alexandru
No ratings yet
Boltz321 PDF
Document7 pages
Boltz321 PDF
Gopi Krishna
No ratings yet
Neural Networks and Neural Language Models
Document27 pages
Neural Networks and Neural Language Models
Fairooz Toroshe
No ratings yet
A Presentation On: By: Edutechlearners
Document33 pages
A Presentation On: By: Edutechlearners
shardapatel
No ratings yet
Quantum Mechanics: Instructor: Dr. SDV TIET, Patiala
Document50 pages
Quantum Mechanics: Instructor: Dr. SDV TIET, Patiala
jukoninja
No ratings yet
Intro 2 ML
Document162 pages
Intro 2 ML
therealj6ix
No ratings yet
4.2 Ann
Document26 pages
4.2 Ann
Matrix Bot
No ratings yet
07 InhomogeneousUniverse 2
Document55 pages
07 InhomogeneousUniverse 2
João Sena
No ratings yet
CHAPTER 10 - Deflection Work-Energy Methods (Part 1)
Document7 pages
CHAPTER 10 - Deflection Work-Energy Methods (Part 1)
Joshua John Julio
No ratings yet
MIT18 445S15 Lecture8 PDF
Document11 pages
MIT18 445S15 Lecture8 PDF
krishna
No ratings yet
Supervised Learning Neural Networks
Document34 pages
Supervised Learning Neural Networks
Lekshmi
No ratings yet
t ε u ε ε 2 v ε ε 0 v ε ε
Document19 pages
t ε u ε ε 2 v ε ε 0 v ε ε
Erjia FU
No ratings yet
Lecture 2.1.2 Single Layer Network
Document4 pages
Lecture 2.1.2 Single Layer Network
Muskan Gahlawat
No ratings yet
Neural Networks For Machine Learning
Document47 pages
Neural Networks For Machine Learning
Mohammad Alzyoud
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #8: Associative Memory and Hopfield Networks
Document9 pages
ECE/CS 559 - Neural Networks Lecture Notes #8: Associative Memory and Hopfield Networks
Nihal Pratap Ghanathe
No ratings yet
Quantum With Slits
Document12 pages
Quantum With Slits
khaldoun sami
No ratings yet
01 - Fundamental Mathematics For Elasticity
Document10 pages
01 - Fundamental Mathematics For Elasticity
Engr. John Mark Payawal, MSCE
No ratings yet
Deep Learning - GAN
Document31 pages
Deep Learning - GAN
JEFFRY
No ratings yet
Generation of New Julia Sets and Mandelbrot Sets For Tangent Function
Document12 pages
Generation of New Julia Sets and Mandelbrot Sets For Tangent Function
iiste
No ratings yet
Cmotion
Document9 pages
Cmotion
deludedgod22
No ratings yet
Unit Iii - Realization of Digital Filters Syllabus
Document10 pages
Unit Iii - Realization of Digital Filters Syllabus
rahaman mahamad
No ratings yet
CE4 Module 5
Document9 pages
CE4 Module 5
bacalczynahmae
No ratings yet
Zwiebach Notes PDF
Document68 pages
Zwiebach Notes PDF
Aram Shojaei
No ratings yet
Unit 2 - Soft Computing - WWW - Rgpvnotes.in
Document14 pages
Unit 2 - Soft Computing - WWW - Rgpvnotes.in
xyz
No ratings yet
A Connection Between Score Matching and Denoising Autoencoders
Document14 pages
A Connection Between Score Matching and Denoising Autoencoders
alekthiery
No ratings yet
Kenfack - Zyczkowski Indicator of Nonclassicality For Two Non-Equivalent Representations of Wigner Function of Qutrit
Document16 pages
Kenfack - Zyczkowski Indicator of Nonclassicality For Two Non-Equivalent Representations of Wigner Function of Qutrit
mammtt
No ratings yet
Atomic Structure 2
Document30 pages
Atomic Structure 2
Prarabdha Sharma
No ratings yet
Echo State Network - Scholarpedia
Document4 pages
Echo State Network - Scholarpedia
inkheart2349
No ratings yet
Unit 2: 1. Mathematical Foundations and Learning Mechanisms
Document26 pages
Unit 2: 1. Mathematical Foundations and Learning Mechanisms
jai gera
No ratings yet
Understanding Backpropagation Algorithm - Towards Data Science
Document11 pages
Understanding Backpropagation Algorithm - Towards Data Science
Kashaf Bakali
No ratings yet
Learning and Stochastic
Document35 pages
Learning and Stochastic
Marcos del Río
No ratings yet
Artificial Neural Networks
Document51 pages
Artificial Neural Networks
Nadra Najib
No ratings yet
Power Flow Solution Using Newton - H HD Raphson Method: Series Expansion. Seesepaso
Document25 pages
Power Flow Solution Using Newton - H HD Raphson Method: Series Expansion. Seesepaso
Biset Sisay
No ratings yet
Varational Mehods (Rayeigh-Ritz Method) : Fentahun Ayalneh
Document27 pages
Varational Mehods (Rayeigh-Ritz Method) : Fentahun Ayalneh
fantahun ayalneh
No ratings yet
P1 - Single Layer Feed Forward Networks
Document52 pages
P1 - Single Layer Feed Forward Networks
Yashaswini
No ratings yet
Causality Bernhard Schölkopf
Document169 pages
Causality Bernhard Schölkopf
Qingsong Guo
No ratings yet
Artificial Neural Networks
Document31 pages
Artificial Neural Networks
Sandra Martínez Castro
No ratings yet
Exercises of Sets and Functions
From Everand
Exercises of Sets and Functions
Simone Malacrida
No ratings yet
Generative Adversarial Networks
Document10 pages
Generative Adversarial Networks
hputluri
No ratings yet
BP RNN
Document3 pages
BP RNN
hputluri
No ratings yet
Module 4
Document10 pages
Module 4
hputluri
No ratings yet
Dlasgmnt-4 (264) - 1 240323 191915
Document3 pages
Dlasgmnt-4 (264) - 1 240323 191915
hputluri
No ratings yet
Signal Flow Graph (Diajarkan)
Document57 pages
Signal Flow Graph (Diajarkan)
Muhammad Dienullah
No ratings yet
Daa m2
Document28 pages
Daa m2
omeshwar v
No ratings yet
An Introduction To Prolog Programming: Ulle Endriss Institute For Logic, Language and Computation University of Amsterdam
Document16 pages
An Introduction To Prolog Programming: Ulle Endriss Institute For Logic, Language and Computation University of Amsterdam
Boroka Csete
No ratings yet
Lista3 de Comp
Document6 pages
Lista3 de Comp
Rafael Souza Morais dos Santos
No ratings yet
CSE 446 Machine Learning: Instructor: Pedro Domingos
Document17 pages
CSE 446 Machine Learning: Instructor: Pedro Domingos
Ruchitha
No ratings yet
CSC 220 Data Structures and Algorithms: Lecture # 3
Document31 pages
CSC 220 Data Structures and Algorithms: Lecture # 3
doha
No ratings yet
Linear Suffix Array Construction by Almost Pure Induced-Sorting
Document10 pages
Linear Suffix Array Construction by Almost Pure Induced-Sorting
Rob Wentworth
No ratings yet
Comsats University Islamabad Department of Computer Sciences
Document8 pages
Comsats University Islamabad Department of Computer Sciences
ŚhãžįfRįžŵāñ
No ratings yet
Logic Reference Guide: Advanced Micro Devices
Document22 pages
Logic Reference Guide: Advanced Micro Devices
Malcom X
No ratings yet
2.3 - 4 Stacks & Queues
Document30 pages
2.3 - 4 Stacks & Queues
hardik22csu077
No ratings yet
Sattarvand Niemann Delius
Document14 pages
Sattarvand Niemann Delius
Kevin Rios
No ratings yet
Minimum Spanning Trees
Document25 pages
Minimum Spanning Trees
Lavin sonker
No ratings yet
Introduction To Pseudocode Algorithm Handout
Document4 pages
Introduction To Pseudocode Algorithm Handout
Akua Latty
No ratings yet
Mathematical Foundations For Machine Learning and Data Science
Document25 pages
Mathematical Foundations For Machine Learning and Data Science
chaimae
No ratings yet
Experiment: Auto Correlation and Cross Correlation: y (T) X (T) H (T)
Document5 pages
Experiment: Auto Correlation and Cross Correlation: y (T) X (T) H (T)
Ravi Kumar
No ratings yet
C Programming and Data Structures
Document5 pages
C Programming and Data Structures
MENAGA
No ratings yet
Turing's Thesis: Costas Busch - LSU 1
Document60 pages
Turing's Thesis: Costas Busch - LSU 1
Kainat Baig
No ratings yet
Charotar University of Science and Technology Faculty of Technology and Engineering
Document10 pages
Charotar University of Science and Technology Faculty of Technology and Engineering
Smruthi Suvarna
No ratings yet
Trees
Document7 pages
Trees
bayentapas
No ratings yet
Lab4 TransformCoding
Document3 pages
Lab4 TransformCoding
swarami
No ratings yet
CLARANS
Document19 pages
CLARANS
Andreea Simona Badoiu
No ratings yet
AOA Module 1
Document56 pages
AOA Module 1
James Jose
No ratings yet
Mamdani Sugeno Fuzzy Method
Document53 pages
Mamdani Sugeno Fuzzy Method
Rahayu Windasari
No ratings yet
Newton-Raphson Method
Document32 pages
Newton-Raphson Method
nafisbadran
No ratings yet
Function Lesson 1 Formative
Document2 pages
Function Lesson 1 Formative
Jack Barbuto
No ratings yet
Basic Operations On MAP: Example
Document9 pages
Basic Operations On MAP: Example
Samara Sequeira
No ratings yet
Fusion Trees
Document19 pages
Fusion Trees
Orange Shelly
No ratings yet
Percon8 Algorithm For Random Number Generation
Document7 pages
Percon8 Algorithm For Random Number Generation
pwnkumar63
No ratings yet
Monte Carlo Tree Search
Document9 pages
Monte Carlo Tree Search
joseph676
No ratings yet
CS402 Midterm Solved MCQS
Document35 pages
CS402 Midterm Solved MCQS
Noman Ur Rehman
No ratings yet