Download as pdf or txt
Download as pdf or txt
You are on page 1of 53

Python Machine Learning Case

Studies: Five Case Studies for the Data


Scientist 1st Edition Danish Haroon
Visit to download the full and correct content document:
https://textbookfull.com/product/python-machine-learning-case-studies-five-case-studi
es-for-the-data-scientist-1st-edition-danish-haroon/
More products digital (pdf, epub, mobi) instant
download maybe you interests ...

Python Machine Learning Case Studies: Five Case Studies


for the Data Scientist 1st Edition Danish Haroon

https://textbookfull.com/product/python-machine-learning-case-
studies-five-case-studies-for-the-data-scientist-1st-edition-
danish-haroon-2/

The Family Nurse Practitioner: Clinical Case Studies


(Case Studies in Nursing) Leslie Neal-Boylan

https://textbookfull.com/product/the-family-nurse-practitioner-
clinical-case-studies-case-studies-in-nursing-leslie-neal-boylan/

Transit Oriented Development Learning from


International Case Studies Ren Thomas

https://textbookfull.com/product/transit-oriented-development-
learning-from-international-case-studies-ren-thomas/

International Case Studies in Event Management


(Routledge International Case Studies in Tourism) 1st
Edition Edited By Judith Mair

https://textbookfull.com/product/international-case-studies-in-
event-management-routledge-international-case-studies-in-
tourism-1st-edition-edited-by-judith-mair/
Applied Analytics through Case Studies Using SAS and R:
Implementing Predictive Models and Machine Learning
Techniques Deepti Gupta

https://textbookfull.com/product/applied-analytics-through-case-
studies-using-sas-and-r-implementing-predictive-models-and-
machine-learning-techniques-deepti-gupta/

Project Management Case Studies Harold Kerzner

https://textbookfull.com/product/project-management-case-studies-
harold-kerzner/

Practical Natural Language Processing with Python With


Case Studies from Industries Using Text Data at Scale
1st Edition Mathangi Sri

https://textbookfull.com/product/practical-natural-language-
processing-with-python-with-case-studies-from-industries-using-
text-data-at-scale-1st-edition-mathangi-sri/

Case Studies in Forensic Psychology Ruth Tully

https://textbookfull.com/product/case-studies-in-forensic-
psychology-ruth-tully/

Case Studies in Building Rehabilitation J.M.P.Q.


Delgado

https://textbookfull.com/product/case-studies-in-building-
rehabilitation-j-m-p-q-delgado/
Python
Machine Learning
Case Studies
Five Case Studies for the Data Scientist

Danish Haroon
Python Machine
Learning Case
Studies
Five Case Studies for the
Data Scientist

Danish Haroon
Python Machine Learning Case Studies
Danish Haroon
Karachi, Pakistan
ISBN-13 (pbk): 978-1-4842-2822-7 ISBN-13 (electronic): 978-1-4842-2823-4
DOI 10.1007/978-1-4842-2823-4
Library of Congress Control Number: 2017957234
Copyright © 2017 by Danish Haroon
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole
or part of the material is concerned, specifically the rights of translation, reprinting, reuse of
illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical
way, and transmission or information storage and retrieval, electronic adaptation, computer
software, or by similar or dissimilar methodology now known or hereafter developed.
Trademarked names, logos, and images may appear in this book. Rather than use a trademark
symbol with every occurrence of a trademarked name, logo, or image we use the names, logos,
and images only in an editorial fashion and to the benefit of the trademark owner, with no
intention of infringement of the trademark.
The use in this publication of trade names, trademarks, service marks, and similar terms, even if
they are not identified as such, is not to be taken as an expression of opinion as to whether or not
they are subject to proprietary rights.
While the advice and information in this book are believed to be true and accurate at the
date of publication, neither the authors nor the editors nor the publisher can accept any legal
responsibility for any errors or omissions that may be made. The publisher makes no warranty,
express or implied, with respect to the material contained herein.
Cover image by Freepik (www.freepik.com)
Managing Director: Welmoed Spahr
Editorial Director: Todd Green
Acquisitions Editor: Celestin Suresh John
Development Editor: Matthew Moodie
Technical Reviewer: Somil Asthana
Coordinating Editor: Sanchita Mandal
Copy Editor: Lori Jacobs
Compositor: SPi Global
Indexer: SPi Global
Artist: SPi Global
Distributed to the book trade worldwide by Springer Science+Business Media New York,
233 Spring Street, 6th Floor, New York, NY 10013. Phone 1-800-SPRINGER, fax (201) 348-4505,
e-mail orders-ny@springer-sbm.com, or visit www.springeronline.com. Apress Media, LLC is
a California LLC and the sole member (owner) is Springer Science + Business Media Finance Inc
(SSBM Finance Inc). SSBM Finance Inc is a Delaware corporation.
For information on translations, please e-mail rights@apress.com, or visit
http://www.apress.com/rights-permissions.
Apress titles may be purchased in bulk for academic, corporate, or promotional use. eBook
versions and licenses are also available for most titles. For more information, reference our Print
and eBook Bulk Sales web page at http://www.apress.com/bulk-sales.
Any source code or other supplementary material referenced by the author in this book is available
to readers on GitHub via the book’s product page, located at www.apress.com/978-1-4842-2822-7.
For more detailed information, please visit http://www.apress.com/source-code.
Printed on acid-free paper
Contents at a Glance

About the Author������������������������������������������������������������������������������ xi


About the Technical Reviewer�������������������������������������������������������� xiii
Acknowledgments��������������������������������������������������������������������������� xv
Introduction����������������������������������������������������������������������������������� xvii


■Chapter 1: Statistics and Probability���������������������������������������������� 1

■Chapter 2: Regression������������������������������������������������������������������ 45

■Chapter 3: Time Series����������������������������������������������������������������� 95

■Chapter 4: Clustering������������������������������������������������������������������ 129

■Chapter 5: Classification������������������������������������������������������������ 161

■Appendix A: Chart types and when to use them������������������������� 197

Index���������������������������������������������������������������������������������������������� 201

iii
Contents

About the Author������������������������������������������������������������������������������ xi


About the Technical Reviewer�������������������������������������������������������� xiii
Acknowledgments��������������������������������������������������������������������������� xv
Introduction����������������������������������������������������������������������������������� xvii


■Chapter 1: Statistics and Probability���������������������������������������������� 1
Case Study: Cycle Sharing Scheme—Determining Brand Persona�������� 1
Performing Exploratory Data Analysis����������������������������������������������������� 4
Feature Exploration�������������������������������������������������������������������������������������������������� 4
Types of variables����������������������������������������������������������������������������������������������������� 6
Univariate Analysis��������������������������������������������������������������������������������������������������� 9
Multivariate Analysis���������������������������������������������������������������������������������������������� 14
Time Series Components���������������������������������������������������������������������������������������� 18

Measuring Center of Measure��������������������������������������������������������������� 20


Mean����������������������������������������������������������������������������������������������������������������������� 20
Median�������������������������������������������������������������������������������������������������������������������� 22
Mode����������������������������������������������������������������������������������������������������������������������� 22
Variance������������������������������������������������������������������������������������������������������������������ 22
Standard Deviation������������������������������������������������������������������������������������������������� 23
Changes in Measure of Center Statistics due to Presence of Constants���������������� 23
The Normal Distribution������������������������������������������������������������������������������������������ 25

v
 ■ Contents

Correlation��������������������������������������������������������������������������������������������� 34
Pearson R Correlation��������������������������������������������������������������������������������������������� 34
Kendall Rank Correlation���������������������������������������������������������������������������������������� 34
Spearman Rank Correlation������������������������������������������������������������������������������������ 35

Hypothesis Testing: Comparing Two Groups������������������������������������������ 37


t-Statistics�������������������������������������������������������������������������������������������������������������� 37
t-Distributions and Sample Size����������������������������������������������������������������������������� 38

Central Limit Theorem��������������������������������������������������������������������������� 40


Case Study Findings������������������������������������������������������������������������������ 41
Applications of Statistics and Probability���������������������������������������������� 42
Actuarial Science���������������������������������������������������������������������������������������������������� 42
Biostatistics������������������������������������������������������������������������������������������������������������ 42
Astrostatistics��������������������������������������������������������������������������������������������������������� 42
Business Analytics�������������������������������������������������������������������������������������������������� 42
Econometrics���������������������������������������������������������������������������������������������������������� 43
Machine Learning��������������������������������������������������������������������������������������������������� 43
Statistical Signal Processing���������������������������������������������������������������������������������� 43
Elections����������������������������������������������������������������������������������������������������������������� 43


■Chapter 2: Regression������������������������������������������������������������������ 45
Case Study: Removing Inconsistencies in Concrete
Compressive Strength��������������������������������������������������������������������������� 45
Concepts of Regression������������������������������������������������������������������������ 48
Interpolation and Extrapolation������������������������������������������������������������������������������ 48
Linear Regression��������������������������������������������������������������������������������������������������� 49
Least Squares Regression Line of y on x���������������������������������������������������������������� 50
Multiple Regression������������������������������������������������������������������������������������������������ 51
Stepwise Regression���������������������������������������������������������������������������������������������� 52
Polynomial Regression������������������������������������������������������������������������������������������� 53

vi
 ■ Contents

Assumptions of Regressions����������������������������������������������������������������� 54
Number of Cases���������������������������������������������������������������������������������������������������� 55
Missing Data����������������������������������������������������������������������������������������������������������� 55
Multicollinearity and Singularity����������������������������������������������������������������������������� 55

Features’ Exploration���������������������������������������������������������������������������� 56
Correlation�������������������������������������������������������������������������������������������������������������� 58

Overfitting and Underfitting������������������������������������������������������������������� 64


Regression Metrics of Evaluation���������������������������������������������������������� 67
Explained Variance Score��������������������������������������������������������������������������������������� 68
Mean Absolute Error����������������������������������������������������������������������������������������������� 68
Mean Squared Error����������������������������������������������������������������������������������������������� 68
R2���������������������������������������������������������������������������������������������������������������������������� 69
Residual������������������������������������������������������������������������������������������������������������������ 69
Residual Plot����������������������������������������������������������������������������������������������������������� 70
Residual Sum of Squares��������������������������������������������������������������������������������������� 70

Types of Regression������������������������������������������������������������������������������ 70
Linear Regression��������������������������������������������������������������������������������������������������� 71
Grid Search������������������������������������������������������������������������������������������������������������� 75
Ridge Regression���������������������������������������������������������������������������������������������������� 75
Lasso Regression��������������������������������������������������������������������������������������������������� 79
ElasticNet��������������������������������������������������������������������������������������������������������������� 81
Gradient Boosting Regression�������������������������������������������������������������������������������� 82
Support Vector Machines���������������������������������������������������������������������������������������� 86

Applications of Regression�������������������������������������������������������������������� 89
Predicting Sales������������������������������������������������������������������������������������������������������ 89
Predicting Value of Bond����������������������������������������������������������������������������������������� 90
Rate of Inflation������������������������������������������������������������������������������������������������������ 90
Insurance Companies��������������������������������������������������������������������������������������������� 91
Call Center�������������������������������������������������������������������������������������������������������������� 91

vii
 ■ Contents

Agriculture�������������������������������������������������������������������������������������������������������������� 91
Predicting Salary���������������������������������������������������������������������������������������������������� 91
Real Estate Industry����������������������������������������������������������������������������������������������� 92


■Chapter 3: Time Series����������������������������������������������������������������� 95
Case Study: Predicting Daily Adjusted Closing Rate of Yahoo��������������� 95
Feature Exploration������������������������������������������������������������������������������� 97
Time Series Modeling��������������������������������������������������������������������������������������������� 98

Evaluating the Stationary Nature of a Time Series Object��������������������� 98


Properties of a Time Series Which Is Stationary in Nature������������������������������������� 99
Tests to Determine If a Time Series Is Stationary��������������������������������������������������� 99
Methods of Making a Time Series Object Stationary�������������������������������������������� 102

Tests to Determine If a Time Series Has Autocorrelation�������������������� 113


Autocorrelation Function�������������������������������������������������������������������������������������� 113
Partial Autocorrelation Function��������������������������������������������������������������������������� 114
Measuring Autocorrelation����������������������������������������������������������������������������������� 114

Modeling a Time Series����������������������������������������������������������������������� 115


Tests to Validate Forecasted Series���������������������������������������������������������������������� 116
Deciding Upon the Parameters for Modeling�������������������������������������������������������� 116

Auto-Regressive Integrated Moving Averages������������������������������������ 119


Auto-Regressive Moving Averages����������������������������������������������������������������������� 119
Auto-Regressive��������������������������������������������������������������������������������������������������� 120
Moving Average���������������������������������������������������������������������������������������������������� 121
Combined Model��������������������������������������������������������������������������������������������������� 122

Scaling Back the Forecast������������������������������������������������������������������� 123


Applications of Time Series Analysis��������������������������������������������������� 127
Sales Forecasting������������������������������������������������������������������������������������������������� 127
Weather Forecasting��������������������������������������������������������������������������������������������� 127
Unemployment Estimates������������������������������������������������������������������������������������� 127

viii
 ■ Contents

Disease Outbreak������������������������������������������������������������������������������������������������� 128


Stock Market Prediction��������������������������������������������������������������������������������������� 128


■Chapter 4: Clustering������������������������������������������������������������������ 129
Case Study: Determination of Short Tail Keywords for Marketing������� 129
Features’ Exploration�������������������������������������������������������������������������� 131
Supervised vs. Unsupervised Learning����������������������������������������������� 133
Supervised Learning��������������������������������������������������������������������������������������������� 133
Unsupervised Learning����������������������������������������������������������������������������������������� 133

Clustering�������������������������������������������������������������������������������������������� 134
Data Transformation for Modeling������������������������������������������������������� 135
Metrics of Evaluating Clustering Models�������������������������������������������������������������� 137

Clustering Models������������������������������������������������������������������������������� 137


k-Means Clustering���������������������������������������������������������������������������������������������� 137
Applying k-Means Clustering for Optimal Number of Clusters����������������������������� 143
Principle Component Analysis������������������������������������������������������������������������������ 144
Gaussian Mixture Model��������������������������������������������������������������������������������������� 151
Bayesian Gaussian Mixture Model������������������������������������������������������������������������ 156

Applications of Clustering������������������������������������������������������������������� 159


Identifying Diseases��������������������������������������������������������������������������������������������� 159
Document Clustering in Search Engines�������������������������������������������������������������� 159
Demographic-Based Customer Segmentation����������������������������������������������������� 159


■Chapter 5: Classification������������������������������������������������������������ 161
Case Study: Ohio Clinic—Meeting Supply and Demand��������������������� 161
Features’ Exploration�������������������������������������������������������������������������� 164
Performing Data Wrangling����������������������������������������������������������������� 168
Performing Exploratory Data Analysis������������������������������������������������� 172
Features’ Generation��������������������������������������������������������������������������� 178

ix
 ■ Contents

Classification��������������������������������������������������������������������������������������� 180
Model Evaluation Techniques������������������������������������������������������������������������������� 181
Ensuring Cross-Validation by Splitting the Dataset���������������������������������������������� 184
Decision Tree Classification���������������������������������������������������������������������������������� 185

Kernel Approximation�������������������������������������������������������������������������� 186


SGD Classifier������������������������������������������������������������������������������������������������������� 187
Ensemble Methods����������������������������������������������������������������������������������������������� 189

Random Forest Classification�������������������������������������������������������������� 190


Gradient Boosting������������������������������������������������������������������������������������������������� 193

Applications of Classification�������������������������������������������������������������� 195


Image Classification��������������������������������������������������������������������������������������������� 196
Music Classification���������������������������������������������������������������������������������������������� 196
E-mail Spam Filtering������������������������������������������������������������������������������������������� 196
Insurance�������������������������������������������������������������������������������������������������������������� 196


■Appendix A: Chart types and when to use them������������������������� 197
Pie chart���������������������������������������������������������������������������������������������� 197
Bar graph�������������������������������������������������������������������������������������������� 198
Histogram�������������������������������������������������������������������������������������������� 198
Stem and Leaf plot������������������������������������������������������������������������������ 199
Box plot����������������������������������������������������������������������������������������������� 199

Index���������������������������������������������������������������������������������������������� 201

x
About the Author

Danish Haroon currently leads the Data Sciences


team at Market IQ Inc, a patented predictive analytics
platform focused on providing actionable, real-time
intelligence, culled from sentiment inflection points.
He received his MBA from Karachi School for Business
and Leadership, having served corporate clients and
their data analytics requirements. Most recently, he
led the data commercialization team at PredictifyME,
a startup focused on providing predictive analytics for
demand planning and real estate markets in the US
market. His current research focuses on the amalgam of
data sciences for improved customer experiences (CX).

xi
About the Technical
Reviewer

Somil Asthana has a BTech from IITBHU India and


a MS from the University of New York at Buffalo (in
the United States) both in Computer Science. He is an
entrepreneur, machine learning wizard, and BigData
specialist consulting with fortune 500 companies like
Sprint, Verizon , HPE, and Avaya. He has a startup
which provides BigData solutions and Data Strategies
to Data Driven Industries in ecommerce, content/
media domain.

xiii
Acknowledgments

I would like to thank my parents and lovely wife for their continuous support throughout
this enlightening journey.

xv
Introduction

This volume embraces machine learning approaches and Python to enable automatic
rendering of rich insights and solutions to business problems. The book uses a
hands-on case study-based approach to crack real-world applications where machine
learning concepts can provide a best fit. These smarter machines will enable your
business processes to achieve efficiencies in minimal time and resources.
Python Machine Learning Case Studies walks you through a step-by-step approach to
improve business processes and help you discover the pivotal points that frame corporate
strategies. You will read about machine learning techniques that can provide support to
your products and services. The book also highlights the pros and cons of each of these
machine learning concepts to help you decide which one best suits your needs.
By taking a step-by-step approach to coding you will be able to understand the
rationale behind model selection within the machine learning process. The book is
equipped with practical examples and code snippets to ensure that you understand the
data science approach for solving real-world problems.
Python Machine Leaarning Case Studies acts as an enabler for people from both
technical and non-technical backgrounds to apply machine learning techniques to
real-world problems. Each chapter starts with a case study that has a well-defined
business problem. The chapters then proceed by incorporating storylines, and code
snippets to decide on the most optimal solution. Exercises are laid out throughout the
chapters to enable the hands-on practice of the concepts learned. Each chapter ends
with a highlight of real-world applications to which the concepts learned can be applied.
Following is a brief overview of the contents covered in each of the five chapters:
Chapter 1 covers the concepts of statistics and probability.
Chapter 2 talks about regression techniques and methods to fine-tune the model.
Chapter 3 exposes readers to time series models and covers the property of
stationary in detail.
Chapter 4 uses clustering as an aid to segment the data for marketing purposes.
Chapter 5 talks about classification models and evaluation metrics to gauge the
goodness of these models.

xvii
CHAPTER 1

Statistics and Probability

The purpose of this chapter is to instill in you the basic concepts of traditional statistics
and probability. Certainly many of you might be wondering what it has to do with
machine learning. Well, in order to apply a best fit model to your data, the most important
prerequisite is for you to understand the data in the first place. This will enable you to find
out distributions within data, measure the goodness of data, and run some basic tests
to understand if some form of relationship exists between dependant and independent
variables. Let’s dive in.

■■Note This book incorporates Python 2.7.11 as the de facto standard for coding
examples. Moreover, you are required to have it installed it for the Exercises as well.

So why do I prefer Python 2.7.11 over Python 3x? Following are some of the reasons:
• Third-party library support for Python 2x is relatively better than
support for Python 3x. This means that there are a considerable
number of libraries in Python 2x that lack support in Python 3x.
• Some current Linux distributions and macOS provide Python 2x
by default. The objective is to let readers, regardless of their OS
version, apply the code examples on their systems, and thus this
is the choice to go forward with.
• The above-mentioned facts are the reason why companies prefer
to work with Python 2x or why they decide not to migrate their
code base from Python 2x to Python 3x.

Case Study: Cycle Sharing


Scheme—Determining Brand Persona
Nancy and Eric were assigned with the huge task of determining the brand persona
for a new cycle share scheme. They had to present their results at this year’s annual
board meeting in order to lay out a strong marketing plan for reaching out to
potential customers.

© Danish Haroon 2017 1


D. Haroon, Python Machine Learning Case Studies, DOI 10.1007/978-1-4842-2823-4_1
Chapter 1 ■ Statistics and Probability

The cycle sharing scheme provides means for the people of the city to commute
using a convenient, cheap, and green transportation alternative. The service has 500
bikes at 50 stations across Seattle. Each of the stations has a dock locking system (where
all bikes are parked); kiosks (so customers can get a membership key or pay for a trip);
and a helmet rental service. A person can choose between purchasing a membership
key or short-term pass. A membership key entitles an annual membership, and the key
can be obtained from a kiosk. Advantages for members include quick retrieval of bikes
and unlimited 45-minute rentals. Short-term passes offer access to bikes for a 24-hour
or 3-day time interval. Riders can avail and return the bikes at any of the 50 stations
citywide.
Jason started this service in May 2014 and since then had been focusing on
increasing the number of bikes as well as docking stations in order to increase
convenience and accessibility for his customers. Despite this expansion, customer
retention remained an issue. As Jason recalled, “We had planned to put in the investment
for a year to lay out the infrastructure necessary for the customers to start using it. We
had a strategy to make sure that the retention levels remain high to make this model self-
sustainable. However, it worked otherwise (i.e., the customer base didn’t catch up with
the rate of the infrastructure expansion).”
A private service would have had three alternatives to curb this problem: get
sponsors on board, increase service charges, or expand the pool of customers. Price hikes
were not an option for Jason as this was a publicly sponsored initiative with the goal of
providing affordable transportation to all. As for increasing the customer base, they had
to decide upon a marketing channel that guarantees broad reach on low cost incurred.
Nancy, a marketer who had worked in the corporate sector for ten years, and Eric, a
data analyst, were explicitly hired to find a way to make things work around this problem.
The advantage on their side was that they were provided with the dataset of transaction
history and thus they didn’t had to go through the hassle of conducting marketing
research to gather data.
Nancy realized that attracting recurring customers on a minimal budget
required understanding the customers in the first place (i.e., persona). As she stated,
“Understanding the persona of your brand is essential, as it helps you reach a targeted
audience which is likely to convert at a higher probability. Moreover, this also helps in
reaching out to sponsors who target a similar persona. This two-fold approach can make
our bottom line positive.”
As Nancy and Eric contemplated the problem at hand, they had questions like the
following: Which attribute correlates the best with trip duration and number of trips?
Which age generation adapts the most to our service?
Following is the data dictionary of the Trips dataset that was provided to Nancy and
Eric:

2
Chapter 1 ■ Statistics and Probability

Table 1-1. Data Dictionary for the Trips Data from Cycles Share Dataset

Feature name Description


trip_id Unique ID assigned to each trip
Starttime Day and time when the trip started, in PST
Stoptime Day and time when the trip ended, in PST
Bikeid ID attached to each bike
Tripduration Time of trip in seconds
from_station_name Name of station where the trip originated
to_station_name Name of station where the trip terminated
from_station_id ID of station where trip originated
to_station_id ID of station where trip terminated
Usertype Value can include either of the following: short-term pass
holder or member
Gender Gender of the rider
Birthyear Birth year of the rider

Exercises for this chapter required Eric to install the packages shown in Listing 1-1.
He preferred to import all of them upfront to avoid bottlenecks while implementing the
code snippets on your local machine.
However, for Eric to import these packages in his code, he needed to install them in
the first place. He did so as follows:
1. Opened terminal/shell
2. Navigated to his code directory using terminal/shell
3. Installed pip:

python get-pip.py

4. Installed each package separately, for example:

pip install pandas

Listing 1-1. Importing Packages Required for This Chapter


%matplotlib inline

import random
import datetime
import pandas as pd
import matplotlib.pyplot as plt
import statistics

3
Chapter 1 ■ Statistics and Probability

import numpy as np
import scipy
from scipy import stats
import seaborn

Performing Exploratory Data Analysis


Eric recalled to have explained Exploratory Data Analysis in the following words:

What do I mean by exploratory data analysis (EDA)? Well, by this I


mean to see the data visually. Why do we need to see the data visually?
Well, considering that you have 1 million observations in your dataset
then it won’t be easy for you to understand the data just by looking at it,
so it would be better to plot it visually. But don’t you think it’s a waste of
time? No not at all, because understanding the data lets us understand
the importance of features and their limitations.

Feature Exploration
Eric started off by loading the data into memory (see Listing 1-2).

Listing 1-2. Reading the Data into Memory


data = pd.read_csv('examples/trip.csv')

Nancy was curious to know how big the data was and what it looked like. Hence, Eric
wrote the code in Listing 1-3 to print some initial observations of the dataset to get a feel
of what it contains.

Listing 1-3. Printing Size of the Dataset and Printing First Few Rows
print len(data)
data.head()

Output

236065

4
Chapter 1 ■ Statistics and Probability

Table 1-2. Print of Observations in the First Seven Columns of Dataset

trip_id starttime stoptime bikeid tripduration from_station_name to_station_name

Occidental Park/
10/13/2014 10/13/2014
431 SEA00298 985.935 2nd Ave & Spring St Occidental Ave S
10:31 10:48
& S Washing...

Occidental Park/
10/13/2014 10/13/2014
432 SEA00195 926.375 2nd Ave & Spring St Occidental Ave S
10:32 10:48
& S Washing...

Occidental Park/
10/13/2014 10/13/2014
433 SEA00486 883.831 2nd Ave & Spring St Occidental Ave S
10:33 10:48
& S Washing...

Occidental Park/
10/13/2014 10/13/2014
434 SEA00333 865.937 2nd Ave & Spring St Occidental Ave S
10:34 10:48
& S Washing...

Occidental Park/
10/13/2014 10/13/2014
435 SEA00202 923.923 2nd Ave & Spring St Occidental Ave S
10:34 10:49
& S Washing...

Table 1-3. Print of Observations in the Last five Columns of Dataset

from_station_id to_station_id usertype gender birthyear

CBD-06 PS-04 Member Male 1960.0

CBD-06 PS-04 Member Male 1970.0

CBD-06 PS-04 Member Female 1988.0

CBD-06 PS-04 Member Female 1977.0

CBD-06 PS-04 Member Male 1971.0

5
Chapter 1 ■ Statistics and Probability

After looking at Table 1-2 and Table 1-3 Nancy noticed that tripduration is
represented in seconds. Moreover, the unique identifiers for bike, from_station, and
to_station are in the form of strings, contrary to those for trip identifier which are in
the form of integers.

Types of variables
Nancy decided to go an extra mile and allocated data type to each feature in the dataset.

Table 1-4. Nancy’s Approach to Classifying Variables into Data Types

Feature name Variable type


trip_id Numbers
bikeid
tripduration
from_station_id
to_station_id
birthyear
Starttime Date
Stoptime
from_station_name to_station_name Text
Usertype
Gender

After looking at the feature classification in Table 1-4 Eric noticed that Nancy had
correctly identified the data types and thus it seemed to be an easy job for him to explain
what variable types mean. As Eric recalled to have explained the following:

In normal everyday interaction with data we usually represent numbers


as integers, text as strings, True/False as Boolean, etc. These are what
we refer to as data types. But the lingo in machine learning is a bit more
granular, as it splits the data types we knew earlier into variable types.
Understanding these variable types is crucial in deciding upon the type
of charts while doing exploratory data analysis or while deciding upon a
suitable machine learning algorithm to be applied on our data.

Continuous/Quantitative Variables
A continuous variable can have an infinite number of values within a given range. Unlike
discrete variables, they are not countable. Before exploring the types of continuous
variables, let’s understand what is meant by a true zero point.

6
Chapter 1 ■ Statistics and Probability

True Zero Point


If a level of measurement has a true zero point, then a value of 0 means you have nothing.
Take, for example, a ratio variable which represents the number of cupcakes bought. A
value of 0 will signify that you didn’t buy even a single cupcake. The true zero point is a
strong discriminator between interval and ratio variables.
Let’s now explore the different types of continuous variables.

Interval Variables
Interval variables exist around data which is continuous in nature and has a numerical
value. Take, for example, the temperature of a neighborhood measured on a daily basis.
Difference between intervals remains constant, such that the difference between 70
Celsius and 50 Celsius is the same as the difference between 80 Celsius and 100 Celsius.
We can compute the mean and median of interval variables however they don’t have a
true zero point.

Ratio Variables
Properties of interval variables are very similar to those of ratio variables with the
difference that in ratio variables a 0 indicates the absence of that measurement. Take,
for example, distance covered by cars from a certain neighborhood. Temperature in
Celsius is an interval variable, so having a value of 0 Celsius does not mean absence of
temperature. However, notice that a value of 0 KM will depict no distance covered by the
car and thus is considered as a ratio variable. Moreover, as evident from the name, ratios
of measurements can be used as well such that a distance covered of 50 KM is twice the
distance of 25 KM covered by a car.

Discrete Variables
A discrete variable will have finite set of values within a given range. Unlike continuous
variables those are countable. Let’s look at some examples of discrete variables which are
categorical in nature.

Ordinal Variables
Ordinal variables have values that are in an order from lowest to highest or vice versa.
These levels within ordinal variables can have unequal spacing between them. Take, for
example, the following levels:
1. Primary school
2. High school
3. College
4. University

7
Chapter 1 ■ Statistics and Probability

The difference between primary school and high school in years is definitely not
equal to the difference between high school and college. If these differences were
constant, then this variable would have also qualified as an interval variable.

Nominal Variables
Nominal variables are categorical variables with no intrinsic order; however, constant
differences between the levels exist. Examples of nominal variables can be gender, month
of the year, cars released by a manufacturer, and so on. In the case of month of year, each
month is a different level.

Dichotomous Variables
Dichotomous variables are nominal variables which have only two categories or levels.
Examples include
• Age: under 24 years, above 24 years
• Gender: male, female

Lurking Variable
A lurking variable is not among exploratory (i.e., independent) or response
(i.e., dependent) variables and yet may influence the interpretations of relationship
among these variables. For example, if we want to predict whether or not an applicant
will get admission in a college on the basis of his/her gender. A possible lurking variable
in this case can be the name of the department the applicant is seeking admission to.

Demographic Variable
Demography (from the Greek word meaning “description of people”) is the study of
human populations. The discipline examines size and composition of populations as well
as the movement of people from locale to locale. Demographers also analyze the effects
of population growth and its control. A demographic variable is a variable that is collected
by researchers to describe the nature and distribution of the sample used with inferential
statistics. Within applied statistics and research, these are variables such as age, gender,
ethnicity, socioeconomic measures, and group membership.

Dependent and Independent Variables


An independent variable is also referred to as an exploratory variable because it is being
used to explain or predict the dependent variable, also referred to as a response variable
or outcome variable.
Taking the dataset into consideration, what are the dependent and independent
variables? Let’s say that Cycle Share System’s management approaches you and asks
you to build a system for them to predict the trip duration beforehand so that the supply

8
Chapter 1 ■ Statistics and Probability

of cycles can be ensured. In that case, what is your dependent variable? Definitely
tripduration. And what are the independent variables? Well, these variables will comprise
of the features which we believe influence the dependent variable (e.g., usertype, gender,
and time and date of the day).
Eric asked Nancy to classify the features in the variable types he had just explained.

Table 1-5. Nancy’s Approach to Classifying Variables into Variable Types

Feature name Variable type


trip_id Continuous
bikeid
tripduration
from_station_id
to_station_id
birthyear
Starttime DateTime
Stoptime
from_station_name String
to_station_name
Usertype gender Nominal

Nancy now had a clear idea of the variable types within machine learning, and also
which of the features qualify for which of those variable types (see Table 1-5). However
despite of looking at the initial observations of each of these features (see Table 1-2) she
couldn’t deduce the depth and breadth of information that each of those tables contains.
She mentioned this to Eric, and Eric, being a data analytics guru, had an answer: perform
univariate analysis on features within the dataset.

Univariate Analysis
Univariate comes from the word “uni” meaning one. This is the analysis performed on a
single variable and thus does not account for any sort of relationship among exploratory
variables.
Eric decided to perform univariate analysis on the dataset to better understand the
features in isolation (see Listing 1-4).

Listing 1-4. Determining the Time Range of the Dataset


data = data.sort_values(by='starttime')
data.reset_index()
print 'Date range of dataset: %s - %s'%(data.ix[1, 'starttime'],
data.ix[len(data)-1, 'stoptime'])

Output

Date range of dataset: 10/13/2014 10:32 - 9/1/2016 0:20

9
Chapter 1 ■ Statistics and Probability

Eric knew that Nancy would have a hard time understanding the code so he decided
to explain the ones that he felt were complex in nature. In regard to the code in Listing
1-4, Eric explained the following:

We started off by sorting the data frame by starttime. Do note that


data frame is a data structure in Python in which we initially loaded
the data in Listing 1-2. Data frame helps arrange the data in a tabular
form and enables quick searching by means of hash values. Moreover,
data frame comes up with handy functions that make lives easier when
doing analysis on data. So what sorting did was to change the position
of records within the data frame, and hence the change in positions
disturbed the arrangement of the indexes which were earlier in an
ascending order. Hence, considering this, we decided to reset the indexes
so that the ordered data frame now has indexes in an ascending order.
Finally, we printed the date range that started from the first value of
starttime and ended with the last value of stoptime.

Eric’s analysis presented two insights. One is that the data ranges from October 2014
up till September 2016 (i.e., three years of data). Moreover, it seems like the cycle sharing
service is usually operational beyond the standard 9 to 5 business hours.
Nancy believed that short-term pass holders would avail more trips than their
counterparts. She believed that most people would use the service on a daily basis rather
than purchasing the long term membership. Eric thought otherwise; he believed that
new users would be short-term pass holders however once they try out the service and
become satisfied would ultimately avail the membership to receive the perks and benefits
offered. He also believed that people tend to give more weight to services they have paid
for, and they make sure to get the maximum out of each buck spent. Thus, Eric decided
to plot a bar graph of trip frequencies by user type to validate his viewpoint (see Listing 1-5).
But before doing so he made a brief document of the commonly used charts and
situations for which they are a best fit to (see Appendix A for a copy). This chart gave
Nancy his perspective for choosing a bar graph for the current situation.

Listing 1-5. Plotting the Distribution of User Types


groupby_user = data.groupby('usertype').size()
groupby_user.plot.bar(title = 'Distribution of user types')

10
Chapter 1 ■ Statistics and Probability

Distribution of user types


160000

140000

120000

100000

80000

60000

40000

20000

Short-Term Pass Holder


Member

usertype

Figure 1-1. Bar graph signifying the distribution of user types

Nancy didn’t understand the code snippet in Listing 1-5. She was confused by the
functionality of groupby and size methods. She recalled asking Eric the following: “I can
understand that groupby groups the data by a given field, that is, usertype, in the current
situation. But what do we mean by size? Is it the same as count, that is, counts trips falling
within each of the grouped usertypes?”
Eric was surprised by Nancy’s deductions and he deemed them to be correct.
However, the bar graph presented insights (see Figure 1-1) in favor of Eric’s view as the
members tend to avail more trips than their counterparts.
Nancy had recently read an article that talked about the gender gap among
people who prefer riding bicycles. The article mentioned a cycle sharing scheme in UK
where 77% of the people who availed the service were men. She wasn’t sure if similar
phenomenon exists for people using the service in United States. Hence Eric came up
with the code snippet in Listing 1-6 to answer the question at hand.

Listing 1-6. Plotting the Distribution of Gender


groupby_gender = data.groupby('gender').size()
groupby_gender.plot.bar(title = 'Distribution of genders')

11
Chapter 1 ■ Statistics and Probability

Distribution of genders
120000

100000

80000

60000

40000

20000

0
Male

Other
Female

gender

Figure 1-2. Bar graph signifying the distribution of genders

Figure 1-2 revealed that the gender gap resonates in states as well. Males seem to
dominate the trips taken as part of the program.
Nancy, being a marketing guru, was content with the analysis done so far. However
she wanted to know more about her target customers to whom to company’s marketing
message will be targetted to. Thus Eric decided to come up with the distribution of
birth years by writing the code in Listing 1-7. He believed this would help the Nancy
understand the age groups that are most likely to ride a cycle or the ones that are more
prone to avail the service.

Listing 1-7. Plotting the Distribution of Birth Years


data = data.sort_values(by='birthyear')
groupby_birthyear = data.groupby('birthyear').size()
groupby_birthyear.plot.bar(title = 'Distribution of birth years',
figsize = (15,4))

12
Chapter 1 ■ Statistics and Probability

Distribution of birth years


14000

12000

10000
8000
6000

4000

2000

0
1931.0
1936.0
1939.0
1942.0
1943.0
1944.0
1945.0
1946.0
1947.0
1948.0
1949.0
1950.0
1951.0
1952.0
1953.0
1954.0
1955.0
1956.0
1957.0
1958.0
1959.0
1960.0
1961.0
1962.0
1963.0
1964.0
1965.0
1966.0
1967.0
1968.0
1969.0
1970.0
1971.0
1972.0
1973.0
1974.0
1975.0
1976.0
1977.0
1978.0
1979.0
1980.0
1981.0
1982.0
1983.0
1984.0
1985.0
1986.0
1987.0
1988.0
1989.0
1990.0
1991.0
1992.0
1993.0
1994.0
1995.0
1996.0
1997.0
1998.0
1999.0
birthyear

Figure 1-3. Bar graph signifying the distribution of birth years

Figure 1-3 provided a very interesting illustration. Majority of the people who had
subscribed to this program belong to Generation Y (i.e., born in the early 1980s to mid
to late 1990s, also known as millennials). Nancy had recently read the reports published
by Elite Daily and CrowdTwist which said that millennials are the most loyal generation
to their favorite brands. One reason for this is their willingness to share thoughts and
opinions on products/services. These opinions thus form a huge corpus of experiences—
enough information for the millenials to make a conscious decision, a decision they will
remain loyal to for a long period. Hence Nancy was convinced that most millennials
would be members rather than short-term pass holders. Eric decided to populate a bar
graph to see if Nancy’s deduction holds true.

Listing 1-8. Plotting the Frequency of Member Types for Millenials


data_mil = data[(data['birthyear'] >= 1977) & (data['birthyear']<=1994)]
groupby_mil = data_mil.groupby('usertype').size()
groupby_mil.plot.bar(title = 'Distribution of user types')

Distribution of user types


120000

100000

80000

60000

40000

20000

0
Member

usertype

Figure 1-4. Bar graph of member types for millenials


13
Chapter 1 ■ Statistics and Probability

After looking at Figure 1-4 Eric was surprised to see that Nancy’s deduction appeared
to be valid, and Nancy made a note to make sure that the brand engaged millennials as
part of the marketing plan.
Eric knew that more insights can pop up when more than one feature is used as part
of the analysis. Hence, he decided to give Nancy a sneak peek at multivariate analysis
before moving forward with more insights.

Multivariate Analysis
Multivariate analysis refers to incorporation of multiple exploratory variables to
understand the behavior of a response variable. This seems to be the most feasible
and realistic approach considering the fact that entities within this world are usually
interconnected. Thus the variability in response variable might be affected by the
variability in the interconnected exploratory variables.
Nancy believed males would dominate females in terms of the trips completed. The
graph in Figure 1-2, which showed that males had completed far more trips than any
other gender types, made her embrace this viewpoint. Eric thought that the best approach
to validate this viewpoint was a stacked bar graph (i.e., a bar graph for birth year, but each
bar having two colors, one for each gender) (see Figure 1-5).

Listing 1-9. Plotting the Distribution of Birth Years by Gender Type


groupby_birthyear_gender = data.groupby(['birthyear', 'gender'])
['birthyear'].count().unstack('gender').fillna(0)
groupby_birthyear_gender[['Male','Female','Other']].plot.bar(title =
'Distribution of birth years by Gender', stacked=True, figsize = (15,4))

Distribution of birth years by Gender


14000
gender
12000 Male
Female
10000 Other
8000

6000

4000

2000

0
1931.0
1936.0
1939.0
1942.0
1943.0
1944.0
1945.0
1946.0
1947.0
1948.0
1949.0
1950.0
1951.0
1952.0
1953.0
1954.0
1955.0
1956.0
1957.0
1958.0
1959.0
1960.0
1961.0
1962.0
1963.0
1964.0
1965.0
1966.0
1967.0
1968.0
1969.0
1970.0
1971.0
1972.0
1973.0
1974.0
1975.0
1976.0
1977.0
1978.0
1979.0
1980.0
1981.0
1982.0
1983.0
1984.0
1985.0
1986.0
1987.0
1988.0
1989.0
1990.0
1991.0
1992.0
1993.0
1994.0
1995.0
1996.0
1997.0
1998.0
1999.0

birthyear

Figure 1-5. Bar graph signifying the distribution of birth years by gender type

14
Chapter 1 ■ Statistics and Probability

The code snippet in Listing 1-9 brought up some new aspects not previously
highlighted.

We at first transformed the data frame by unstacking, that is, splitting,


the gender column into three columns, that is, Male, Female, and Other.
This meant that for each of the birth years we had the trip count for all
three gender types. Finally, a stacked bar graph was created by using this
transformed data frame.

It seemed as if males were dominating the distribution. It made sense as well. No?
Well, it did; as seen earlier, that majority of the trips were availed by males, hence this
skewed the distribution in favor of males. However, subscribers born in 1947 were all
females. Moreover, those born in 1964 and 1994 were dominated by females as well. Thus
Nancy’s hypothesis and reasoning did hold true.
The analysis in Listing 1-4 had revealed that all millennials are members. Nancy was
curious to see what the distribution of user type was for the other age generations. Is it
that the majority of people in the other age generations were short-term pass holders?
Hence Eric brought a stacked bar graph into the application yet again (see Figure 1-6).

Listing 1-10. Plotting the Distribution of Birth Years by User Types


groupby_birthyear_user = data.groupby(['birthyear', 'usertype'])
['birthyear'].count().unstack('usertype').fillna(0)

groupby_birthyear_user['Member'].plot.bar(title = 'Distribution of birth


years by Usertype', stacked=True, figsize = (15,4))

Distribution of birth years by Usertype


14000
12000
10000
8000
6000
4000
2000
0
1931.0
1936.0
1939.0
1942.0
1943.0
1944.0
1945.0
1946.0
1947.0
1948.0
1949.0
1950.0
1951.0
1952.0
1953.0
1954.0
1955.0
1956.0
1957.0
1958.0
1959.0
1960.0
1961.0
1962.0
1963.0
1964.0
1965.0
1966.0
1967.0
1968.0
1969.0
1970.0
1971.0
1972.0
1973.0
1974.0
1975.0
1976.0
1977.0
1978.0
1979.0
1980.0
1981.0
1982.0
1983.0
1984.0
1985.0
1986.0
1987.0
1988.0
1989.0
1990.0
1991.0
1992.0
1993.0
1994.0
1995.0
1996.0
1997.0
1998.0
1999.0

birthyear

Figure 1-6. Bar graph signifying the distribution of birth years by user types

15
Another random document with
no related content on Scribd:
Between these two creatures there arose a quarrel, which
terminated in a fight. The toad in vain tried to swallow its antagonist,
but the latter rushed upon it, and with his horn pierced a hole in its
side, out of which the water gushed in floods, and soon overflowed
the face of the earth. At this time Nanahbozhoo was living on the
earth, and observing the water rising higher and higher, he fled to the
loftiest mountain for refuge. Perceiving that even this retreat would
be soon inundated, he selected a large cedar tree which he
purposed to ascend, should the waters come up to him. Before they
reached him he caught a number of animals and fowls, and put them
into his bosom. At length the water covered the mountain.
Nanahbozhoo then ascended the cedar tree, and as he went up he
plucked its branches and stuck them in the belt which girdled his
waist. When he reached the top of the tree he sang, and beat the
tune with his arrow upon his bow, and as he sang the tree grew and
kept pace with the water for a long time. At length he abandoned the
idea of remaining any longer on the tree, and took the branches he
had plucked, and with them constructed a raft, on which he placed
himself with the animals and fowls. On this raft he floated about for a
long time, till all the mountains were covered, and all the beasts of
the earth and fowls of the air, except those he had with him,
perished.
“At length Nanahbozhoo thought of forming a new world, but how
to accomplish it without any materials he knew not, till the idea
occurred to him that if he could only obtain a little of the earth, which
was then under water, he might succeed in making a new world out
of the old one. He accordingly employed the different animals he had
with him that were accustomed to diving. First, he sent the loon, a
water fowl of the penguin species, down into the water in order to
bring up some of the old earth; but it was not able to reach the
bottom, and after remaining in the water some time, came up dead.
Nanahbozhoo then took it, blew upon it, and it came to life again. He
next sent the otter, which also failing to reach the bottom, came up
dead, and was restored to life in the same manner as the loon. He
then tried the skill of the heaver, but without success. Having failed
with all these diving animals, he last of all took the musk-rat; on
account of the distance it had to go to reach the bottom, it was gone
a long time, and came up dead. On taking it up, Nanahbozhoo
found, to his great joy, that it had reached the earth, and had
retained some of the soil in each of its paws and mouth. He then
blew upon it, and brought it to life again, at the same time
pronouncing many blessings on it, saying, that as long as the world
he was about to make should endure, the musk-rat should never
become extinct. This prediction of Nanahbozhoo is still spoken of by
the Indians when referring to the rapid increase of the musk-rat.
Nanahbozhoo then took the earth which he found in the musk-rat’s
paws and mouth, and having rubbed it with his hands to fine dust, he
placed it on the waters and blew upon it; then it began to grow larger
and larger, until it was beyond the reach of his eye. In order to
ascertain the size of the world, and the progress of its growth and
expansion, he sent a wolf to run to the end of it, measuring its extent
by the time consumed in his journey. The first journey he performed
in one day, the second took him five days, the third ten, the fourth a
month, then a year, five years, and so on, until the world was so
large that Nanahbozhoo sent a young wolf that could just run, which
died of old age before he could accomplish the journey.
Nanahbozhoo then said the world was large enough, and
commanded it to cease from growing. After this Nanahbozhoo took a
journey to view the new world he had made, and as he travelled he
created various tribes of Indians, and placed them in different parts
of the earth; he then gave them various religions, customs, and
manners.
“This Nanahbozhoo now sits at the North Pole, overlooking all the
transactions and affairs of the people he has placed on the earth.
The Northern tribes say that Nanahbozhoo always sleeps during the
winter; but, previous to his falling asleep, fills his great pipe, and
smokes for several days, and that it is the smoke arising from the
mouth and pipe of Nanahbozhoo which produces what is called
‘Indian summer.’”
They have, however, legends that relate to times anterior to the
flood, even to the beginning of Time itself and the days of Adam and
Eve. Mr. Kohl, of “Lake Superior” celebrity, contributes the following:
“On Torch Lake it is said, that Kitchi-Manitou (the Good Spirit) first
made the coast of our lake. He strewed the sand and formed a fine
flat dry beach or road round the lake. He found that it was splendid
walking upon it, and often wandered along the beach. One day he
saw something lying on the white sand. He picked it up. It was a very
little root. He wondered whether it would grow if planted in the
ground, and made the trial. He planted it close to the edge of the
water in the sand, and when he came again, the next day, a thick
and large reed-bed had grown out of it through which the wind
rustled. This pleased him, and he sought for and collected more little
roots and other seeds from the sand and spread them around so that
they soon covered the rocks and land with grass and fine forests, in
which the birds and other animals came to live. Every day he added
something new to the creation, and did not forget to place fish and
other creatures in the water.
“One day when Kitchi-Manitou was again walking along the sand,
he saw something moving in the reeds, and noticed a being coming
out of the water entirely covered with silver-glistening scales like a
fish, but otherwise formed like a man. Kitchi-Manitou was curious to
see on what the being lived and whether it ate herbs, especially as
he saw it constantly stooping and plucking herbs which it swallowed.
The man could not speak, but at times when he stooped he sighed
and groaned.
“The sight moved Kitchi-Manitou with compassion in the highest
degree, and as a good thought occurred to him, he immediately
stepped into his canoe and paddled across to the island, which still
lies in the centre of the lake. Here he set to work providing the man
the company of a squaw. He formed her nearly like what he had
seen the man to be, and also covered her body with silver-glistening
scales. Then he breathed life into her, and carried her across in his
canoe to the other bank of the lake, telling her that if she wandered
busily along the lake and looked about her, she would perhaps find
something to please her. For days the squaw wandered about one
shore of the lake, while the man was seeking herbs for food on the
other. One day the latter went a little further, and, to his great
surprise, saw footsteps in the sand much like those he himself made.
At once he gave up seeking herbs and followed these footsteps, as
he hoped there were other beings like himself on the lake. The
squaw during her long search had left so many footsteps that the
man at first feared they might belong to a number of Indians, and
they might perhaps be hostile. Hence he crept along carefully in the
bush, but always kept an eye on the trail in the sand.
“At last he found the being he sought sitting on a log near the
shore. Through great fatigue she had fallen asleep. He looked
around to the right and left but she was quite alone. At length he
ventured to come out of the bushes; he approached her with
uncertain and hesitating steps; he seized her and she opened her
eyes.
“‘Who art thou?’ he said, for he could now suddenly speak, ‘Who
art thou, what is thy name, and whither dost thou come?’
“‘My name is Mami,’ she replied, ‘and Kitchi-Manitou brought me
here from that island, and told me I should find something here I
liked. I think that thou art the promised one.’
“‘On what dost thou live?’ the man asked the woman.
“‘Up to this time I have eaten nothing, for I was looking for thee.
But now I feel very hungry; hast thou anything to eat?’
“Straightway the man ran into the bushes, and collected some
roots and herbs he had found good to eat, and brought them to the
squaw, who greedily devoured them.
“The sight of this moved Kitchi-Manitou, who had watched the
whole scene from his lodge. He immediately came over in his canoe,
and invited the couple to his island. Here they found a handsome
large house prepared for them, and a splendid garden round it. In
the house were glass windows, and in the rooms tables and chairs
and beds and conveniences of every description. In the garden grew
every possible sort of useful and nourishing fruits, potatoes,
strawberries, apple-trees, cherry and plum trees; and close by were
large fine fields planted with Indian corn and beans.
“They ate and lived there for days and years in pleasure and
happiness; and Kitchi-Manitou often came to them and conversed
with them. ‘One thing,’ he said, ‘I must warn you against. Come
hither; see, this tree in the middle of the garden is not good. I did not
plant it, but Matchi-Manitou planted it. In a short time this tree will
blossom and bear fruits which look very fine and taste very sweet;
but do not eat of them, for if ye do so ye will die.’ They paid attention
to this, and kept the command a long time, even when the tree had
blossomed and the fruit had set. One day, however, when Mami
went walking in the garden, she heard a very friendly and sweet
voice say to her, ‘Mami, Mami, why dost thou not eat of this beautiful
fruit? it tastes splendidly.’ She saw no one, but she was certain the
voice did not come either from Kitchi-Manitou or her husband. She
was afraid and went into the house. The next day though, she again
went into the garden, and was rather curious whether the same
pleasant voice would speak to her again. She had hardly
approached the forbidden tree, when the voice was heard once
more, ‘Mami, Mami, why dost thou not taste this splendid fruit? it will
make thy heart glad.’ And with these words a young handsome
Indian came out of the bushes, plucked a fruit, and placed it in her
hand. ‘Thou canst make famous preserves of it for thy household,’
the friendly Indian added.
“The fruit smelled pleasantly, and Mami licked it a little. At length
she swallowed it entirely, and felt as if drunk. When her husband
came to her soon after she persuaded him also to eat of it; he did so,
and also felt as if drunk. But this had scarce happened ere the silver
scales with which their bodies had been covered, fell off; only twenty
of these scales remained on, but they had lost their brilliancy,—ten
on the fingers and ten on the toes. They saw themselves to be quite
uncovered, and began to be ashamed, and withdrew timidly into the
bushes of the garden.
“The young Indian had disappeared, but the angry Kitchi-Manitou
soon came to them, and said ‘It is done; ye have eaten of Matchi-
Manitou’s fruit, and must now die. Hence it is necessary that I should
marry you, lest the whole human race might die out with you. Ye
must perish, but shall live on in your children and children’s children.’
Kitchi-Manitou banished them also from the happy isle, which
immediately grew wild, and bore them in his canoe to the shores of
the lake. But he had mercy on them still. He gave the man a bow
and arrow, and told him he would find animals which were called
deer. These he was to shoot, and Mami would get ready the meat for
him, and make mocassins and clothing of the hide.
“When they reached the other shore, Mami’s husband tried first of
all this bow and the arrows. He shot into the sand, and the arrows
went three inches deep into the ground.
“Mami’s husband then went for the first time to hunt, and saw in
the reeds on the lake an animal moving, which he recognised for a
deer, as Kitchi-Manitou had described it to him. He shot his arrow,
and the animal straightway leaped from the water on shore, sank on
its knees, and died. He ran up and drew his arrow from the wound,
examined it, found that it was quite uninjured, and placed it again in
his quiver, as he thought he could use it again. When he brought the
deer to his squaw, she cut it into pieces, washed it, and laid the hide
aside for shoes and clothing; but soon saw that they, as Indians,
could not possibly eat the meat raw, as the barbarous Eskimos in the
north do: she must cook it, and for that purpose have fire.
“This demand embarrassed the man for a moment, as he had
never yet seen any meat boiling or roasting before the fire. But he
soon knew how to help himself. He took two different descriptions of
wood, rubbed them against each other, and soon made a bright fire
for his squaw. The squaw in the meanwhile had prepared a piece of
wood as a spit, placed a lump of meat on it, and held it in the fire.
They both tasted it, and found it excellent. ‘As this is so good, the
rest will be famous,’ she said, and cut it all up and put it in the kettle,
and then they ate nearly all the deer that same evening. This gave
Mami’s husband strength and courage, and he went out hunting
again the next morning, and shot a deer; and so he did every day,
while his squaw built a lodge for him, and sewed clothes and
mocassins.
“One day when he went a-hunting again, the man found a book
lying under a tree. He stopped and looked at it. The book began
speaking to him, and told him what he was to do, and what to leave
undone. It gave him a whole series of orders and prohibitions. He
found this curious, and did not much like it; but he took it home to his
squaw.
“‘I found this book under a tree,’ he said to her, ‘which tells me to
do all sorts of things, and forbids me doing others; I find this hard,
and I will carry it back to where I found it.’ And this he did too,
although his squaw begged him to keep it. ‘No,’ he said, ‘it is too
thick; how could I drag it about with me in my medicine bag?’ And he
laid the book again, the next day, under the tree, where he had taken
it up; and so soon as he laid it down, it disappeared. The earth
swallowed it up.
“Instead of it, however, another book appeared in the grass. That
was easy and light, and only written on a couple of pieces of birch
bark. It also spoke to him in the clear and pure Ojibbeway language;
forbade him nothing, and ordered him nothing; and only taught him
the use and advantages of the plants in the forest and on the prairie.
This pleased him much, and he put the book at once in his hunting
bag, and went into the forest, and collected all the plants, roots,
flowers, and herbs which it pointed out to him.
“Quite loaded with herbs of fifty different sorts, he returned to his
squaw Mami. He sorted them out, and found they were all medicine,
good in every accident of life. As he had in this way become a great
medicine man, as well as a mighty hunter, he wanted but little more
to satisfy his earthly wants. The children his wife bore him he
brought up as good hunters; taught them the use of the bow;
explained to them the medicine book; and told them, shortly before
his and Mami’s death, the history of their creation and their former
mode of life on the Torch Lake island with Kitchi-Manitou, who now,
after so much suffering and sorrow, was graciously pleased to
receive them again.”
The following story was communicated to Mr. Jones, a native
minister, by an Ojibbeway Indian named Netahgawineneh, and will
serve to illustrate the source whence they derive their ideas of a
future state:—
“In the Indian country far west an Indian once fell into a trance,
and when he came to life again, he gave the following account of his
journey to the world of spirits.
“I started, said he, my soul or spirit in company with a number of
Indians who were travelling to the same spirit land. We directed our
footsteps towards the sun-setting. On our journey we passed
through a beautiful country, and on each side of our trail saw
strawberries as large as a man’s head. We ate some of them, and
found them very sweet; but one of our party who kept loitering
behind, came up to us and demanded, ‘Why were we eating a ball of
fire?’ We tried to persuade him to the contrary, but the foolish fellow
would not listen to our words, and so went on his way hungry. We
travelled on until we came to a dark, swollen and rapid river, over
which was laid a log vibrating in a constant wavering motion. On this
log we ventured to cross, and having arrived at the further end of it,
we found that it did not reach the shore; this obliged us to spring with
all our might to the land. As soon as we had done this, we perceived
that the supposed log on which we had crossed was a large serpent,
waving and playing with his huge body over the river. The foolish
man behind was tossed about until he fell off, but he at length
succeeded in swimming to shore. No sooner was he on land than a
fierce and famished pack of wolves fell on him and began to tear him
to pieces, and we saw him no more. We journeyed on, and by and
by came within sight of the town of spirits. As soon as we made our
appearance there was a great shout heard, and all our relatives ran
to meet us and to welcome us to their happy country. My mother
made a feast for me, and prepared everything that was pleasant to
eat and to look upon; here we saw all our forefathers; and game and
corn in abundance; all were happy and contented.
“After staying a short time, the Great Spirit of the place told me
that I must go back to the country I had left, as the time had not yet
arrived for me to dwell there. I accordingly made ready to return; and
as I was leaving, my mother reproached me by all manner of foolish
names for wishing to leave so lovely and beautiful a place. I took my
departure, and soon found myself in the body and in the world I had
left.”
The allegorical traditions of the North American Indians regarding
the introduction into the world of the art of medicine and of religious
mysteries are still more extravagant than their theogony. We will cite
from Dominech the principal among them, to give an idea of all the
others of the same kind.
“A great Manitou of heaven came once on earth and married a
woman, who died, after giving birth to four children. The first was
called Manabozho, and was the protector and friend of men; the
second Chibiabos, took care of the dead and ruled over the empire
of shadows, that is to say, of souls; the third, called Onabasso, fled
towards the north as soon as he saw the day, and was
metamorphosed into a white rabbit without ceasing to be a Manitou;
the last of the four brothers was called Chokanipok, that is to say, the
man of the fire-stone.
“When Manabozho grew up, he declared war against Chokanipok,
whom he accused of being the cause of their mother’s death. The
struggle was long and terrible. The surface of the earth still
preserves traces of the battles which were fought between them.
Chokanipok was conquered by his brother, his entrails were taken
out, and changed into vines, and the fragments of his body became
fire-stones, which were scattered all over the globe, and supplied
man with the principle of fire. Manabozho it was who taught the Red
Indians the mode of manufacturing axe blades, arrow points, traps,
nets, how to turn stones and bones to use to capture wild animals,
fish, and birds. He was very much attached to Chibiabos, with whom
he lived in the desert, where they conferred together for the good of
humanity. The material power and the extraordinary intelligence of
these two superior beings excited the jealousy of the Manitous, who
lived in the air, on earth, and in the water. This jealousy gave rise to
a conspiracy against the life of Chibiabos. Manabozho warned him to
be on his guard against the machinations of the Manitous, and never
to quit him. But one day Chibiabos ventured alone during the winter
on one of the great frozen lakes; when he arrived in the middle of the
lake the Manitous broke the ice, and Chibiabos sank to the bottom of
the water, where his body remained buried.
“Manabozho wandered for a long time on the banks of the lake,
calling his beloved brother; his voice trembling with fear and hope,
was heard from afar. When he had no longer any doubt of the
misfortune which had befallen him, his fury knew no bounds; he
declared war against the wicked Manitous, killed a great number of
them, and his rage no less than his despair spread consternation
through the whole desert. After the first moments devoted to
revenge, he painted his face black, covered his head with a veil of
the same colour, then sat down on the shore of the lake and
mourned the deceased for six years, making the neighbouring
echoes incessantly ring with the cherished name of Chibiabos. The
Manitous deeply moved by his profound grief, assembled to consult
on the means they should take to console the unhappy mourner. The
oldest and wisest of them all, who had not been concerned in the
death of Chibiabos, took the task of reconciliation on himself. Aided
by the other spirits, he built a sacred lodge near that of Manabozho,
and prepared a great feast. He procured the best tobacco
imaginable, and put it in a beautiful calumet; then placing himself at
the head of the Manitous, who walked in procession, each carrying
under his arm a bag made of the skins of various animals, and filled
with precious medicine, he went to invite Manabozho to the festival.
Manabozho uncovered his head, washed his face, and followed the
Manitous to the sacred lodge. On his entrance he was offered a
drink composed of the most exquisite medicines, a rite initiatory to
propitiation. Manabozho drank it in a single draught, and immediately
felt the grief and sadness lifted from his soul. The Manitous then
began their dances and songs, which were succeeded by several
ceremonies and by feats of address and magic, performed with the
intention of restoring serenity of mind to the unconsolable protector
and friend of the human race. It was thus the mysteries of the dance
and of medicine were introduced on the earth.
“The Manitous then united all their powers to recall Chibiabos to
life, which they did without difficulty. He was, however, forbidden to
enter the sacred lodge; but receiving a flaming brand, he was sent to
preside over the empire of the dead. Manabozho, quite consoled,
ate, drank, danced, and smoked the sacred pipe, went away to the
Great Spirit, and returned to earth to instruct men in the useful arts,
in the mysteries of dancing and medicine, and in the curative
properties of plants. It is he who causes the medicinal plants to grow
which cure sickness and wounds; it is he who killed all the monsters
with which the desert was peopled. He placed spirits at the four
cardinal points to protect the human race: that of the north sends
snow and ice to facilitate the chase in winter; that of the south
causes the maize to grow, as well as all kinds of fruit and tobacco;
that of the west gives rain; and that of the east brings light, by
commanding the sun to move round the globe. Thunder is the voice
of these four spirits, to whom tobacco is offered in thanksgiving for
the various blessings which they confer on the inhabitants of the
earth.”
Among the more ignorant tribes of North American Indians the
god of thunder is believed to be the eagle. The Rev. Peter Jones
asserts this to be the belief of the Ojibbeways. When a thunderbolt
strikes a tree or the ground, they fancy that the thunder has shot his
fiery arrow at a serpent and caught it away in the twinkling of an eye.
Some Indians affirm that they have seen the serpent taken up by the
thunder into the clouds. They believe that the thunder has its abode
on the top of a high mountain in the west, where it lays its eggs and
hatches its young, like an eagle, and whence it takes its flight into
different parts of the earth in search of serpents.
The following is a story related by an Indian who is said to have
ventured, at the risk of his life, to visit the abode of the thunders:
“After fasting, and offering my devotions to the thunder, I with much
difficulty ascended the mountain, the top of which reached to the
clouds. To my great astonishment, as I looked I saw the thunder’s
nest, where a brood of young thunders had been hatched and
reared. I saw all sorts of curious bones of serpents, on the flesh of
which the old thunders had been feeding their young; and the bark of
the young cedar trees peeled and stripped, on which the young
thunders had been trying their skill in shooting their arrows before
going abroad to hunt serpents.”
Another thunder tradition says: “That a party of Indians were once
travelling on an extensive plain, when they came upon two young
thunders lying in their nest in their downy feathers, the old thunders
being absent at the time. Some of the party took their arrows, and
with the point touched the eyes of the young thunders. The moment
they did so their arrows were shivered to pieces, as if a young
thunder arrow had struck them. One of the party, more wise than his
companions, entreated them not to meddle with them, warning them
that if they did they would pay dearly for their folly. The foolish young
men would not listen, but continued to teaze and finally killed them.
As soon as they had done this a black cloud appeared, advancing
towards them with great fury. Presently the thunder began to roar
and send forth volumes of its fiery indignation. It was too evident that
the old thunders were enraged on account of the destruction of their
young—soon, with a tremendous crash, the arrows of the mighty
thunder-god fell on the foolish men and destroyed them, but the wise
and good Indian escaped unhurt.”
In proof of the American Indian’s suspicious nature, especially as
regards matters connected with a religion differing from his own, Dr.
Franklin furnishes the following little story:—
“Conrad Weiser, the Indian interpreter, who had gone to
Ouondago with a message from Government, demanded hospitality
of one of his old friends, the famous Canastatego, one of the chiefs
of the six nations. Happy to meet after a long separation, the two
friends were joyous and chatty. Conrad was soon seated on furs
spread on the ground, with a meal of boiled vegetables, venison, and
rum and water before him. After dinner Canastatego asked how the
years since they had parted had passed with his friend, whence he
came, where going, and what the aim of his journey. When all these
questions were answered, the old Indian said, ‘Conrad, you have
lived a great deal among white people, and know their customs. I
have myself been several times to Albany, and have observed that
once every seven days they shut up their shops and assemble in a
large house; tell me wherefore, and what they do there?’—‘They
assemble to hear and learn good things,’ replied Conrad.—‘I have no
doubt,’ said the Indian, ‘that they have told you that; but I do not
much believe in their words, and I will tell you why. Some time ago I
went to Albany to sell furs and to buy blankets, powder, and knives.
You know I am in the habit of dealing with Hans Hanson, but on that
day I had a mind to try another merchant, but first went to Hans
Hanson and asked what he would give for beaver skins. He
answered that he could not pay a higher price than four shillings a
pound. “But,” added he, “I cannot talk of such affairs to-day; it is the
day of our meeting to hear good things, and I am going to the
assembly.” I then reflected that as there was no possibility of my
transacting business on that day, I too might as well go to the great
house and hear good things.
“‘There was a man in black who seemed in a great passion while
speaking to the people. I did not understand what he said, but
perceiving that he looked a good deal at me, I thought that perhaps
he was angry at seeing me in the house. I therefore hastened to
leave it, and went and seated myself outside on the ground against
the wall, and began to smoke till the end of the ceremony. I fancied
that the man in black had spoken about beavers, and I suspected
that that was the motive of the meeting, so that as the crowd was
coming out, I stopped my merchant, and said to him, “Well, Hans
Hanson, I hope you will give me more than four shillings a
pound.”—“No,” answered he, “I can only give you three shillings and
a half.”
“‘I then spoke to other merchants, but all were unanimous in the
price. This proved clearly that I was right in my suspicions, and that
the pretended intention of meeting to hear good things was only
given out to mislead opinions, and that the real aim of the meeting
was to come to an understanding to cheat the Indians as to the price
of their goods. Reflect, Conrad, and you will see that I have guessed
the truth; for if white people meet so often to hear good things, they
would have finished by knowing some long since, but on that head
they are still very ignorant. You know our ways when white men
travel over our lands and enter our colonies: we treat them as I treat
you; when wet we dry them, we warm them when they are cold, we
give them food and drink, and spread our best furs for them to
repose on, asking for nothing in return. But if I go to a white man and
ask for eat and drink, he answers me, “Begone, Indian dog!” You
thus see that they have as yet learned very few good things, which
we know, because our mothers taught them to us when we were little
children, and that the subject of all these assemblies is to cheat us in
the price of our beavers.’”
Here is a strange story of North American Indian “second sight”
and not the less remarkable as it is recorded by a highly respectable
Wesleyan Missionary who had it from a Government Indian Agent in
Upper Canada.
“In the year 1804, wintering with the Winebagos on the Rock river,
I had occasion to send three of my men to another wintering house,
for some flour which I had left there in the fall on my way up the river.
The distance being about one and a half day’s journey from where I
lived, they were expected to return in about three days. On the sixth
day after their absence I was about sending in quest of them, when
some Indians, arriving from the spot, said that they had seen nothing
of them. I could now use no means to ascertain where they were: the
plains were extensive, the paths numerous, and the tracks they had
made were the next moment covered by the drift snow. Patience was
my only resource; and at length I gave them up for lost.
“On the fourteenth night after their departure, as several Indians
were smoking their pipes, and telling stories of their war parties,
huntings, etc., an old fellow, who was a daily visitor, came in. My
interpreter, a Canadian named Felix, pressed me, as he had
frequently done before, to employ this conjuror, as he could inform
me about the men in question. The dread of being laughed at had
hitherto prevented my acceding to his importunities; but now, excited
by curiosity, I gave the old man a quarter-pound of tobacco and two
yards of ribbon, telling him that if he gave me a true account of the
missing ones, I would, when I ascertained the fact, give him a bottle
of rum. The night was exceedingly dark and the house situated on a
point of land in a thick wood. The old fellow withdrew, and the other
Indians retired to their lodges.
“A few minutes after, I heard Wahwun (an egg) begin a
lamentable song, his voice increasing to such a degree that I really
thought he would have injured himself. The whole forest appeared to
be in agitation, as if the trees were knocking against each other; then
all would be silent for a few seconds; again the old fellow would
scream and yell, as if he were in great distress. A chill seized me,
and my hair stood on end; the interpreter and I stared at each other
without power to express our feelings. After remaining in this
situation a few minutes the noise ceased, and we distinctly heard the
old chap singing a lively air. We expected him in, but he did not
come. After waiting some time, and all appearing tranquil in the
woods, we went to bed. The next morning I sent for my friend
Wahwun to inform me of his jaunt to see the men.
“‘I went,’ said he, ‘to smoke the pipe with your men last night, and
found them cooking some elk meat, which they got from an Ottawa
Indian. On leaving this place they took the wrong road on the top of
the hill; they travelled hard on, and did not know for two days that
they were lost. When they discovered their situation they were much
alarmed, and, having nothing more to eat, were afraid they would
starve to death. They walked on without knowing which way they
were going until the seventh day, when they were met near the
Illinois river by the Ottawa before named, who was out hunting. He
took them to his lodge, fed them well, and wanted to detain them
some days until they had recovered their strength; but they would
not stay. He then gave them some elk meat for their journey home,
and sent his son to put them into the right road. They will go to
Lagothenes for the flour you sent them, and will be at home in three
days.’ I then asked him what kind of place they were encamped in
when he was there? He said, ‘they had made a shelter by the side of
a large oak tree that had been torn up by the roots, and which had
fallen with the head towards the rising sun.’
“All this I noted down, and from the circumstantial manner in
which he related every particular, though he could not possibly have
had any personal communication with or from them by any other
Indians, I began to hope my men were safe, and that I should again
see them. On the appointed day the interpreter and myself watched
most anxiously, but without effect. We got our suppers, gave up all
hopes, and heartily abused Wahwun for deceiving us. Just as we
were preparing for bed, to my great joy the men rapped at the door,
and in they came with the flour on their backs. My first business was
to enquire of their travels. They told me the whole exactly as the old
Indian had before stated, not omitting the tree or any other
occurrence; and I could have no doubt but that the old fellow had got
his information from some evil or familiar spirit.”
As has already been mentioned in this book, belief in dreams is
very intimately associated with North-American Indian religious
belief; and when an Indian dreams anything that seems to him
important, he does not fail to enter in his birch bark “note book” the
most salient points of it. Being, as a rule, however, incapable of
giving his thoughts a tangible appearance by the ordinary caligraphic
process, he draws the pictures just as he sees them in his vision.
From the birch bark of a brave, by name the “Little Wasp,” Mr. Kohl
copied the picture which appears on the next page: and this is the
explanation of it:—
“The dreamer lying on his bed of moss and grass is dreaming the
dream of a true hunter, and there are the heads of the birds and
beasts which his guardian spirit promises that he shall not chase in
vain. The man wearing the hat is a Frenchman, which the Little
Wasp also dreams about.
“The Indians picture themselves without a hat because they
usually have no other head gear than their matted hair, or, at most, a
cloth wound turban-wise round the head. The hat, however, appears
to them such a material part of a European—as much a part of their
heads as the horse to the Centaur—that a hat in a picture-writing
always indicates a European.
“It was not at all stupid of Little Wasp to dream of a Frenchman,
for of what use would a sky full of animals prove to him unless he
had a good honest French traiteur to whom he could sell the skins
and receive in exchange fine European wares? The vault of the sky
is represented by several semi-circular lines in the same way as it is
usually drawn on their gravestones. On some occasions I saw the
strata or lines variously coloured—blue, red, and yellow, like the
hues of the rainbow. Perhaps, too, they may wish to represent that
phenomenon as well. But that the whole is intended for the sky is
proved by the fact that the ordinary colour is a plain blue or grey. The
bird soaring in the heavens was meant for the kimou which so often
appears in the dreams of these warlike hunters.
“When I asked the dreamer what he meant by the strokes and
figures at the foot of the drawing, he said: ‘It is a notice that I fasted
nine days on account of this dream. The nine strokes indicate the
number nine, and a small figure of the sun over them means days.’
“His own self he indicated by the human figure. It has no head but
an enormous heart in the centre of the breast.
“Though the head is frequently missing, the heart is never omitted
in Indian figures, because they have as a general rule, more heart
than brains, more courage than sense. ‘I purposely made the heart
rather large,’ the author of the picture remarked, ‘in order to show
that I had so much courage as to endure a nine days’ fast.’ He
omitted the head, probably because he felt that sense was but little
mixed up with such nonsensical fasting.
“‘But why hast thou painted the sun once more, and with so much
care over it?’ asked I. ‘Because,’ replied he, ‘the very next morning
after my fast was at an end, the sun rose with extraordinary
splendour, which I shall never forget, for a fine sunrise after a dream
is the best sign that it will come to pass.’”
The superstitions, in fact, of all Indians, are singularly wild, poetic,
and primitive. Catlin, in his “Descriptive Catalogue,” gives some
strange and interesting particulars. He says, for instance, the Sioux
have a superstitious belief that they will conquer their enemy if they
go through the following ceremony:—A dog’s liver and heart are
taken raw and bleeding and placed upon a sort of platform, and,
being cut into slips, each man dances upon it, bites off and swallows
a piece of it, in the certain belief that he has thus swallowed a piece
of the heart of his enemy whom he has slain in battle. Again, it is
supposed that he most is in the favour of the Great Spirit who can
throw most arrows from an Indian bow before the first cast reaches
the ground, and Catlin says: “So eager are the Indians for this
supremacy that I have known men who could get eight arrows in the
air, all moving at the same time.” Another superstition takes the
shape of a belief in dancing compelling a flock of buffaloes to turn
upon the path of the dancers. This superstitious gyration is only
resorted to when a tribe is absolutely starving, and it is accompanied
by a song to the Great Spirit, imploring Him to help them, promising,
at the same time, a burnt sacrifice, or, as they themselves generally
put it, that the Great Spirit shall have the best of the meat cooked for
himself.
A far more charming use of the superstitious, or rather religious,
dances is that of the warriors upon their return from battle, when, if
they can exhibit scalps, they are justified in dancing and wailing in
front of the wigwams of the widows of their companions who have
been killed. If the widow is one of a man of any importance in the
tribe, especially if he has been a medicine man, they cast presents
upon the ground for the use of the widowed woman.
Another strange superstition is the green corn dance—the
sacrifice of the first kettle to the Great Spirit. Four medicine men,
whose bodies are painted with white clay, dance around the kettle
until the corn is well boiled, and they then burn it to cinders as an
offering to the Great Spirit. The fire is then destroyed, and new fire
created by rubbing two sticks together, with which the corn for their
own feast is cooked.
Again, there is a snow-shoe dance, performed at the first fall of
snow, and which is as solemn a rite as any in the Indian faith.
Another strange superstition is that by which an Indian becomes a
medicine or mystery man. Splints of wood are thrust through his
flesh and by these he hangs from a pole, and gazes, medicine bag in
hand, at the sun, from its rising to its setting. This voluntary torture
entitles him to great respect for the remainder of his life as a
medicine or mystery man—in another word, an astrologer. The
history of Indian superstition has yet to be written.
The North American is no less adept at picture “talking” than at
picture writing. Burton, while sojourning among the Prairie Indians,
devoted considerable attention to this art as practised among them.
He describes it as a system of signs, some conventional, others
instinctive or imitative, which enables tribes who have no
acquaintance with each other’s customs and tongues to hold limited
but sufficient communication, An interpreter who knows all the signs,
which, however, are so numerous and complicated that to acquire
them is the labour of years, is preferred by the whites even to a good
speaker. The sign system doubtless arose from the necessity of a
communicating medium between races speaking many different
dialects and debarred by circumstances from social intercourse.
The first lesson is to distinguish the signs of the different tribes,
and it will be observed that the French voyageurs and traders have
often named the Indian nations from their totemic or masonic
gestures.
The Pawnees imitate a wolf’s ears with the two forefingers—the
right hand is always understood unless otherwise specified—
extended together, upright, on the left side of the head.
The Araphos, or Dirty Noses, rub the right side of that organ with
the forefingers; some call this bad tribe the Smellers, and make their
sign to consist of seizing the nose with the thumb and forefinger.
The Comanches imitate by the waving of the hand or forefinger
the forward crawling motion of a snake.
The Cheyennes, Piakanoves, or Cut Wrists, draw the lower edge
of the hand across the left arm, as if gashing it with a knife.
The Sioux, by drawing the lower edge of the hand across the
throat; it is a gesture not unknown to us, but forms a truly ominous
salutation, considering those by whom it is practised; hence the
Sioux are called by the Yutas Hand-cutters.
The Hapsaroke, by imitating the flapping of the bird’s wings with
the two hands, palms downwards, brought close to the shoulders.
The Kiowas, or Prairie-men, make the signs of the prairie, and of
drinking water.
The Yutas, they who live on mountains, have a complicated sign
which denotes “living in mountains.”
The Black-feet, called by the Yutas Paike or Goers, pass the right
hand, bent spoon-fashion, from the heel to the little toe of the right
foot.
The following are a few preliminaries indispensable to the prairie
traveller:
Halt! Raise the hand, with the palm in front, and push it backward
and forward several times, a gesture well known in the East.
I don’t know you. Move the raised hand, with the palm in front,
slowly to the right and left.
I am angry. Close the fist, place it against the forehead, and turn it
to and fro in that position.
Are you friendly? Raise both hands, grasped as if in the act of
shaking hands, or lock the two forefingers together, while the hands
are raised.
See. Strike out the two forefingers forward from the eyes.
Smell. Touch the nose-tip. A bad smell is expressed by the same
sign, ejaculating at the same time, “pooh,” and making the sign of
bad.
Taste. Touch the tongue-tip.
Eat. Imitate the actions of conveying food with the fingers to the
mouth.
Drink. Scoop up with the hand imaginary water into the mouth.
Smoke. With the crooked index describe a pipe in the air,
beginning at the lips, then wave the open hand from the mouth to
imitate curls of smoke.
Speak. Extend the open hand from the chin.
Fight. Make a motion with both fists to and fro like a pugilist of the
eighteenth century who preferred a high guard.
Kill. Smite the sinister palm earthwards, with the dexter fist
sharply, the sign of going down, or strike out with the dexter fist
towards the ground, meaning to “shut down,” or pass the dexter
index under the left forefinger, meaning to “go under.”
Some of the symbols of relationship are highly appropriate and
not ungraceful or unpicturesque. Man is denoted by a sign which will
not admit of description; woman by passing the hand down both
sides of the head, as if smoothing or stroking the long hair. For a

You might also like