Professional Documents
Culture Documents
Analysis of Fishing Data - Application of Count Regression
Analysis of Fishing Data - Application of Count Regression
Analysis of Fishing Data - Application of Count Regression
• TO
TEST WHETHER A FIT IS GOOD OR NOT, PEARSON'S CHI-SQUARED
STATISTIC CAN BE USED
• , ASYMPTOTICALLY (PROVIDED THE FIT IS GOOD)
• AS
A MEASURE OF RELATIVE QUALITY OF TWO STATISTICAL MODELS FOR A
GIVEN A GIVEN SET OF DATA, A. I. C. CAN BE USED
• BEING THE MOST COMMON COUNT MODEL, POISSON MODEL CAN BE USED
PRIMARILY, PROVIDED MEAN AND VARIANCE OF NUMBER OF FISHES CAUGHT
ARE APPROXIMATELY EQUAL
• IF MEAN IS SIGNIFICANTLY LESS THAN VARIANCE, NEGATIVE BINOMIAL COULD
BE AN APPROPRIATE MODEL
• IF ZEROES (EITHER ALL OR SOME) OCCUR DUE TO SOME INDEPENDENT
PROCESS FROM THE COUNT PROCESS, HURDLE OR ZERO-INFLATED MODELS
CAN GIVE BETTER RESULTS
INITIAL APPROACHES
DEFINING DESCRIPTIVE
VARIABLES MEASURES COLUMN DIAGRAM
OBSERVATIONS
• P. M. F. OF DISTRIBUTION -
• M. L. E. OF IS THE SAMPLE MEAN
• , AN EXTREMELY UNLIKELY VALUE
• POISSON MODEL PROVIDES A POOR FIT FOR FISHING DATA
NEGATIVE BINOMIAL FITTING
• P. M. F. OF DISTRIBUTION -
• M. L. E. OF IS THE SAMPLE MEAN
• SOLVING SCORE EQUATION , M. L. E. OF IS 5.438.
• , WHICH IS A LIKELY VALUE AT 1% LEVEL OF SIGNIFICANCE
• NEGATIVE BINOMIAL PROVIDES A SATISFACTORY FIT FOR OUR DATA
NEGATIVE BINOMIAL REGRESSION
• WE
PROCEED TO FIT A REGRESSION MODEL WITH LOGARITHMIC LINK
FUNCTION
• MODEL :
• ARE UNKNOWN PARAMETERS, IS THE DISTURBANCE TERM
• WE ESTIMATE BY THE METHOD OF MAXIMUM LIKELIHOOD
• FITTED MODEL :
ZERO-INFLATED POISSON DISTRIBUTION
• M.
L. E. OF IS OBTAINED AS 7.628 BY SOLVING SCORE EQUATION
• M. L. E. OF IS OBTAINED AS 0.568 FROM THE EQUATION
• BASED ON THESE ESTIMATES, WE PROCEED TO FIT A REGRESSION MODEL
• WITH LOGIT LINK FOR BINARY PROCESS AND LOGARITHMIC LINK FOR COUNT PROCESS
• FITTED MODEL
COMPARISON OF THE TWO REGRESSION
MODELS
• LOG-LIKELIHOOD OF NEGATIVE BINOMIAL MODEL COMES OUT TO BE
(-)405.222
• AND THAT OF ZERO-INFLATED POISSON MODEL IS (-)752.132
• THUS A. I. C. OF THE FORMER MODEL IS APPROXIMATELY 820
• AND THAT OF THE LATTER BECOMES MORE OR LESS 1521
• NOTE THAT, INCLUSION OF ZERO-INFLATION WORSENS EFFICACY OF THE
REGRESSION MODEL
• THUS WE CHOOSE NEGATIVE BINOMIAL AS A PREFERABLE MODEL OVER THE
ALTERNATIVE
INSIGHT INTO THE FIT
SIGNIFICANCE OF THE COEFFICIENTS
• TO
TEST WHETHER ALL THE FACTORS AFFECT THE RESPONSE, WE WISH TO
TEST
• APPROPRIATE TEST STATISTIC FOR EACH TEST ARE THE STANDARDIZED
ESTIMATES OF THAT PARAMETER
• WE OBTAIN THE VALUES OF THE FOUR TEST STATISTICS AS 9.273, (-)9.623,
(-)4.918 AND (-)3.362
• AND THE CRITICAL POINT IS OBTAINED AS 1.960, AT 5% LEVEL OF
SIGNIFICANCE
• HENCE ALL THE COEFFICIENTS CAN BE CONSIDERED TO BE SIGNIFICANTLY
NONZERO
INTERPRETATION OF THE COEFFICIENTS
• TO
UNDERSTAND HOW THE NUMBER OF FISHES CAUGHT IS AFFECTED BY
THE GROUP COMPOSITION, WE NEED TO UNDERSTAND THE IMPLICATIONS OF
THE VALUES OF THE ESTIMATES OF THE COEFFICIENTS
• AND CAN BE INTERPRETED AS THE INCREMENT IN THE LOGARITHM OF THE
AVERAGE NUMBER OF FISHES CAUGHT DUE TO UNIT INCREASE IN NUMBER
OF PERSONS AND CHILDREN IN A GROUP, WHEN OTHERS ARE HELD FIXED
• THUS PRESENCE OF AN ADDITIONAL PERSON IN THE GROUP WILL INCREASE
EXPECTED RESPONSE BY 189%, WHEREAS AN ADDITIONAL CHILD WILL
DECREASE IT BY 83%.
… CONTINUED