Professional Documents
Culture Documents
Corpus Linguistics
Corpus Linguistics
linguistics
Lecture 5
Corpus linguistics
- Or i gi n at e d i n the 19 50 s
• C o r p u s - l a r g e p r i n c i p l e d c o l l e c t i o n o f n a t u ra l t e x t s
• L a n g u a g e h a s b e e n c o l l e c t e d f r o m n a t u ra l l y o c c u r r i n g s o u r c e s r a t h e r t h a n f r o m
surveys or questionnaires
• I n t h e c a s e o f s p o k e n l a n g u a g e , t h i s m e a n s fi r s t r e c o r d i n g a n d t h e n t r a n s c r i b i n g t h e
speech.
• T h e r e a r e a n u m b e r o f e x i s t i n g c o r p o ra : t h e B r i t i s h N a t i o n a l C o r p u s ( B N C ) , t h e
C o r p u s o f Co n t e m p o r a r y A m e r i c a n E n g l i s h ( C O C A ) , t h e B r ow n C o r p u s , t h e
L a n c a s t e r / O s l o – B e r g e n ( LO B ) C o r p u s a n d t h e H e l s i n k i C o r p u s o f E n g l i s h Te x t s .
B e c a u s e c o r p u s l i n g u i s t i c s u s e s l a r g e c o l l e c t i o n s o f n a t u ra l l y o c c u r r i n g l a n g u a g e ,
t h e u s e o f c o m p u t e r s f o r a n a l y s i s i s i m p e ra t i v e
Types of corpora
➣ Web as a corpus
General Corpora
1. We can get
from a corpus, is
frequency of
occurrence
information.
What can a corpus tell us?
Providing a basis for deciding which language features and structures are important
a n d a l s o h o w va r i o u s f e a t u r e s a n d s t r u c t u r e s a r e u s e d .
D e c i s i o n s c a n n o w b e g r o u n d e d o n a c t u a l p a t t e r n s o f l a n g u a g e u s e i n va r i o u s
situations (such as spoken or written, formal or casual situations).
Te a c h e r s c a n s h a p e i n s t r u c t i o n b a s e d o n c o r p u s - b a s e d i n f o r m a t i o n . ( e . g . i f t h e f o c u s
of instruction is conversational English, teachers could read corpus investigations on
s p o ke n l a n g u a g e t o d e t e r m i n e w h i c h f e a t u r e s a n d g r a m m a t i c a l s t r u c t u r e s a r e
characteristic of conversational English)
I f t h e f o c u s o f i n s t r u c t i o n i s a p a r t i c u l a r g ra m m a t i c a l s t r u c t u r e , c o r p u s - b a s e d
studies can provide a picture of the range of use of that particular structure,
i d e n t i f y i n g l e x i c a l a n d p ra g m a t i c c o - o c c u r r e n c e p a t t e r n s a s s o c i a t e d w i t h i t .
L e a r n e r s c a n b e a c t i v e l y i nv o l v e d i n e x p l o r i n g c o r p o ra ; i f a d e q u a t e f a c i l i t i e s d o
not exist, teachers can bring in printouts or results from corpus searches for use in
the classroom