Welcome to Scribd!

Machine Choromanska Majorization 01

Uploaded by

0% found this document useful (0 votes)

13 views4 pages

The document discusses majorization techniques for optimization problems that lack closed-form solutions. It presents a tighter quadratic bound for partition functions of log-linear models that can be used within majorization methods. The bound is initialized and then updated iteratively based on the features and weights of each configuration. Finally, it notes the bound leads to faster majorization methods and can find better local maxima when applied to problems involving graphical models, high-dimensional data, or latent variables.

Original Description:

Original Title

machine_choromanska_majorization_01

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

13 views4 pages

Machine Choromanska Majorization 01

Uploaded by

Vikas Kumar

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 4

Search inside document

Optimization Partition Bound Extensions & Experiments

Majorization for CRFs and Latent Likelihoods

Tony Jebara & Anna Choromanska

Columbia University
Optimization Partition Bound Extensions & Experiments

Majorization
If cost function θ ∗ = arg minθ C (θ) has no closed form solution
majorization (IIS, GIS, etc.) uses a surrogate Q with closed form
to monotonically improve from initial θ0 .
Find bound Q(θ, θi ) ≥ C (θ)
where Q(θi , θi ) = C (θi )
Update θi+1 = arg minθ Q(θ, θi )
Repeat

Majorization preferred until [Wallach ’03, Andrew & Gao ’07].

For example, both IIS and GIS are slower than 1st order methods.
The culprit: loose and complicated bounds.

Let’s fix this!!!

Optimization Partition Bound Extensions & Experiments

Partition Function Bound

For log-linear model partition functions
Z (θ) = y h(y ) exp(θ > f(y ))
P

our tighter quadratic bound is

ln Z (θ) ≤ ln z + 12 (θ − θ̃)> Σ(θ − θ̃) + (θ − θ̃)>µ

Init z → 0+ , µ = 0, Σ = zI
For each y ∈ Ω {
α = h(y ) exp(θ̃ > f(y ))
l = f(y ) − µ
tanh( 1 ln(α/z)) >
Σ + = 2 ln(α/z)
2
ll
α
µ + = z+α l
z += α }
Optimization Partition Bound Extensions & Experiments

Extensions & Experiments

Graphical models: use efficient message-passing of bounds.
High dimensional models: use fast low-rank bounds.

Latent models: bounds find better local maxima, faster.

HW 3
Document7 pages
HW 3
Ben
No ratings yet
Proximal Gradient Descent: (And Acceleration)
Document33 pages
Proximal Gradient Descent: (And Acceleration)
Saheli Chakraborty
No ratings yet
Cost/sensitive Bandots
Document16 pages
Cost/sensitive Bandots
Bob Zeno
No ratings yet
Practice Midterm
Document8 pages
Practice Midterm
Olabiyi Ridwan
No ratings yet
Softadam Unifying SGD and Adam For Better Stochastic Gradient Descent
Document12 pages
Softadam Unifying SGD and Adam For Better Stochastic Gradient Descent
David
No ratings yet
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
Document4 pages
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
suhar adi
No ratings yet
Lab Report #1: Transient Stability Analysis For Single Machine Infinite Bus Bar Using MATLAB
Document5 pages
Lab Report #1: Transient Stability Analysis For Single Machine Infinite Bus Bar Using MATLAB
Ijaz Ahmad
0% (1)
Admm Without A Fixed Penalty Parameter Faster Convergence With New Adaptive Penalization
Document11 pages
Admm Without A Fixed Penalty Parameter Faster Convergence With New Adaptive Penalization
Nurul Hidayanti Anggraini
No ratings yet
Lagrangian Relaxation: An Overview: General Idea
Document4 pages
Lagrangian Relaxation: An Overview: General Idea
ami554
No ratings yet
Lec4 PDF
Document7 pages
Lec4 PDF
juanagallardo01
No ratings yet
Nonlinear Least Squares Theory - Lecture Notes
Document33 pages
Nonlinear Least Squares Theory - Lecture Notes
Anonymous tsTtieMHD
No ratings yet
Constrained Optimization
Document26 pages
Constrained Optimization
Adil Ha
No ratings yet
Practice Midterm 2010
Document4 pages
Practice Midterm 2010
Erico Archeti
No ratings yet
Properties of Limits: Main Limit Theorem
Document11 pages
Properties of Limits: Main Limit Theorem
awaisaltaf781
No ratings yet
The Transportation Lag: X (T) X (T-X (S) X(S)
Document9 pages
The Transportation Lag: X (T) X (T-X (S) X(S)
ashish gupta
No ratings yet
ML, WK 04-Questions With Answers
Document4 pages
ML, WK 04-Questions With Answers
ravinyse
No ratings yet
Big O Notations
Document19 pages
Big O Notations
ravigobi
No ratings yet
Qs ML
Document8 pages
Qs ML
Ms Bukhary
No ratings yet
Hidden Markov Modelss
Document59 pages
Hidden Markov Modelss
Ana Maria
No ratings yet
Introduction To Optimization: CBMM Summer School Aug 12, 2018
Document64 pages
Introduction To Optimization: CBMM Summer School Aug 12, 2018
Carlos Alonso Aznarán Laos
No ratings yet
Ghahramani Lecture2
Document30 pages
Ghahramani Lecture2
Carlos Jiménez
No ratings yet
Master 2 Mathbigdata: S. Ga Iffas
Document51 pages
Master 2 Mathbigdata: S. Ga Iffas
Whala
No ratings yet
Lec 3
Document22 pages
Lec 3
mohammed.elbakkalielammari
No ratings yet
CPSC540: Regularization, Regularization, Nonlinear Prediction and Generalization
Document23 pages
CPSC540: Regularization, Regularization, Nonlinear Prediction and Generalization
juanagallardo01
No ratings yet
CVaR Minimization and Extentions
Document20 pages
CVaR Minimization and Extentions
fanny novika
No ratings yet
A Generic Proximal Algorithm For Convex Optimization - Application To Total Variation Minimization
Document5 pages
A Generic Proximal Algorithm For Convex Optimization - Application To Total Variation Minimization
Augusto Zebadúa
No ratings yet
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
Document6 pages
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
nxp He
No ratings yet
Differentiation and It's Application
Document7 pages
Differentiation and It's Application
Kirti Ranjan Sahoo
No ratings yet
Lecture 9 - SVM
Document42 pages
Lecture 9 - SVM
Husein Yusuf
No ratings yet
Test 1 Week 3
Document3 pages
Test 1 Week 3
Cristian Cabello
No ratings yet
Estimadores Extremos: Algoritmos e Bootstrap
Document31 pages
Estimadores Extremos: Algoritmos e Bootstrap
Victor Haselmann Arakawa
No ratings yet
Lange Talk
Document40 pages
Lange Talk
sanka csaat
No ratings yet
Large Scale Learning With String Kernels: Sören Sonnenburg
Document26 pages
Large Scale Learning With String Kernels: Sören Sonnenburg
debo
No ratings yet
Ps 1
Document5 pages
Ps 1
Emre Uysal
No ratings yet
Lag Lead Compensation Theory (Found in Google)
Document10 pages
Lag Lead Compensation Theory (Found in Google)
Subhashish Sahoo
No ratings yet
Duality and KKT
Document23 pages
Duality and KKT
Amir Voabil
No ratings yet
Section2 3-Filled
Document14 pages
Section2 3-Filled
Sononame
No ratings yet
Practice Midterm
Document4 pages
Practice Midterm
Arka Mitra
No ratings yet
Artigo 2 Convexity Issues in System Identification
Document9 pages
Artigo 2 Convexity Issues in System Identification
claudyane
No ratings yet
Ass 1
Document3 pages
Ass 1
Vibhanshu Lodhi
No ratings yet
Mathematical Model
Document1 page
Mathematical Model
gabrielbertho
No ratings yet
2018 en
Document9 pages
2018 en
Mohammad Almoghabat Alm
No ratings yet
02 Sparsity Overview PDF
Document52 pages
02 Sparsity Overview PDF
Ashwani Singh
No ratings yet
Quantum Annealing Basics and : Hidetoshi Nishimori
Document35 pages
Quantum Annealing Basics and : Hidetoshi Nishimori
fisica_musica
No ratings yet
Covariances of ARMA Processes
Document9 pages
Covariances of ARMA Processes
Ryan Teehan
No ratings yet
3 Sampling PDF
Document20 pages
3 Sampling PDF
Waqas Qammar
No ratings yet
p6-REF-0 JMLR
Document16 pages
p6-REF-0 JMLR
SELVAKUMAR R
No ratings yet
Scaman17a Supp
Document3 pages
Scaman17a Supp
zeSky Armour
No ratings yet
Smo
Document5 pages
Smo
bhatt_chintan7
No ratings yet
Approximation Methods For Bilevel Programming: Saeed Ghadimi Mengdi Wang February 8, 2018
Document27 pages
Approximation Methods For Bilevel Programming: Saeed Ghadimi Mengdi Wang February 8, 2018
Tran Ngoc Thang
No ratings yet
Cs 229, Public Course Problem Set #2 Solutions: Kernels, SVMS, and Theory
Document8 pages
Cs 229, Public Course Problem Set #2 Solutions: Kernels, SVMS, and Theory
suhar adi
No ratings yet
Calculus: Mrs. Dougherty's Class
Document66 pages
Calculus: Mrs. Dougherty's Class
Abdullah Soomro
No ratings yet
MA 105: Calculus D1 - T5, Tutorial 03: Aryaman Maithani
Document16 pages
MA 105: Calculus D1 - T5, Tutorial 03: Aryaman Maithani
Sundar
No ratings yet
ACOPF-Based TNEP Using GWOA
Document8 pages
ACOPF-Based TNEP Using GWOA
Divya Rajoria
No ratings yet
cs229.... Machine Language. Andrew NG
Document17 pages
cs229.... Machine Language. Andrew NG
krishna
No ratings yet
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
Document6 pages
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
Rohan Deb
No ratings yet
Chapter IV. Complex Integration: IV.1. Riemann-Stieltjes Integrals
Document8 pages
Chapter IV. Complex Integration: IV.1. Riemann-Stieltjes Integrals
TOM DAVIS
No ratings yet
40 LogisticRegression-1
Document2 pages
40 LogisticRegression-1
Abderrahmane Kraiouch
No ratings yet
Math 55 LE1 Reviewer Notes
Document6 pages
Math 55 LE1 Reviewer Notes
Jc Quintos
No ratings yet
Long-Memory Time Series: Theory and Methods
From Everand
Long-Memory Time Series: Theory and Methods
Wilfredo Palma
No ratings yet
Gone Were The Days, When Travelers Used To Look For Long, Rough Maps To Chose Their Route. Now, They Rely Upon A Popular Tool, Google Maps
Document7 pages
Gone Were The Days, When Travelers Used To Look For Long, Rough Maps To Chose Their Route. Now, They Rely Upon A Popular Tool, Google Maps
Vikas Kumar
No ratings yet
Classic Modeling Methods vs. Pytorch Modeling: Optimizers and Train Test Splits For Model Training
Document9 pages
Classic Modeling Methods vs. Pytorch Modeling: Optimizers and Train Test Splits For Model Training
Vikas Kumar
No ratings yet
We Know What You Want: An Advertising Strategy Recommender System For Online Advertising
Document9 pages
We Know What You Want: An Advertising Strategy Recommender System For Online Advertising
Vikas Kumar
No ratings yet
Convolution Lec1
Document62 pages
Convolution Lec1
Vikas Kumar
No ratings yet
Heterogeneous Graph Attention Network
Document21 pages
Heterogeneous Graph Attention Network
Vikas Kumar
No ratings yet
Staffing Industry: Analytics Offering Jul 2014
Document14 pages
Staffing Industry: Analytics Offering Jul 2014
Vikas Kumar
No ratings yet
Homework #1: 1 Problem 1
Document3 pages
Homework #1: 1 Problem 1
Vikas Kumar
No ratings yet
Monte Carlo Ray Tracing: Siggraph 2003 Course 44
Document171 pages
Monte Carlo Ray Tracing: Siggraph 2003 Course 44
Vikas Kumar
No ratings yet
The Optics of The Solar Tower Reflector: Pergamon P I I: S 0 0 3 8 - 0 9 2 X (0 0) 0 0 1 3 7 - 7
Document13 pages
The Optics of The Solar Tower Reflector: Pergamon P I I: S 0 0 3 8 - 0 9 2 X (0 0) 0 0 1 3 7 - 7
Vikas Kumar
No ratings yet