Professional Documents
Culture Documents
Week 1 - Tutorial: Open Source & Git: Sebastian Ebert April 16, 2015
Week 1 - Tutorial: Open Source & Git: Sebastian Ebert April 16, 2015
Week 1 - Tutorial: Open Source & Git: Sebastian Ebert April 16, 2015
Sebastian Ebert
April 16, 2015
0
Today
1
· Organizational things
· Open source project structure
· Gitlab & Git
· Resources
2
Organizational Things
3
· new room for tutorials (TBD)
· warning: programming skills required
· Master students: email your name, matriculation number, and
email address to us
· got your Gitlab password?
· check at https://webmail2.cip.ifi.lmu.de
· if necessary, set up email forwarding in CipConf
4
Open Source Project Structure
5
· src/: source code
· 01
· 02
· project
6
· lib/: external libraries
· perl/
· python/
· java/
· …
7
· doc/: documentation
· README.md: Markdown-formatted readme
· Makefile: routines for compiling and/or installing (even script
projects!)
· LICENSE: http://choosealicense.com
· .gitignore: https://www.gitignore.io
8
Git & Gitlab
9
Why Version Control?
10
Why Git?
Benefits
· widely used
· fast and easy branching and merging
· free and open source
11
Drawbacks
12
Why Gitlab?
13
Task 1: Groups
· form a group
· 2-3 people
· mix skill levels
14
Task 2: Setup Gitlab
15
5. Go to Settings -> Members and add your group members with
Developer or Master privileges.
6. Give us (the instructors) access by
· either making the project public at Project -> Visibility Level
· or adding us (David Kaumanns, Sebastian Ebert) as new members
with Reporter privileges.
7. Email us the link to the project repository, the group name and
your email addresses.
16
Git Walk Through
17
Task 3: Exercise
18
Idea of Git
Figure 1:
http://www.git-scm.com/book/en/v2/Getting-Started-Git-Basics 19
Let’s do it
20
Commands you Need
21
· Do your changes.
· Stage your changes (i.e. tell Git that they exist):
· git add <file name|patterns>
22
· Commit your changes to your local repository:
· git commit -am "initial commit"
· -a (--all): automatically stage files that have been modified and
deleted (only those Git already knows).
· -m (--message): use an inline commit message.
23
· Do fresh pulls regularly:
· git pull
· Check differences
· git diff
· if problems with color occur: git config --global core.pager
"less -r"
24
· Review previous commit
· git show
25
Branching
26
· Create new branch:
· git branch awesome-feature
27
Ready to merge your new feature into the master branch?
28
Words to remember
29
Great Git tutorials
· http://www.git-scm.com/doc
· http://gitimmersion.com
30
Resources
31
Figure 3: Which resources do you know?
32
Lexicons
· CISLex
33
Treebanks
(S
(NP
(NNP John)
)
(VP
(VPZ loves)
(NP
(NNP Mary)
)
)
(...)
)
34
· The Penn Treebank Project
· https://www.cis.upenn.edu/~treebank/
35
Knowledge bases & ontologies
· WordNet
· https://wordnet.princeton.edu
· Germanet
· http://www.sfs.uni-tuebingen.de/GermaNet
36
Parallel text corpora
· Europarl
· proceedings of the European Parliament in 21 languages
· http://www.statmt.org/europarl/
· WMT11
· from a shared task
· http:
//www.statmt.org/wmt11/translation-task.html#download
37
Questions & answers
38
Collocations & NGrams
· http://corpus.byu.edu
· English collocations
· http://collocations.ooz.ie/
· CIS Wittfind
· http://wittfind.cis.uni-muenchen.de
39
Pretrained models & representations
40
Text corpora
· News articles
· Reuters news:
http://trec.nist.gov/data/reuters/reuters.html
· Wall Street Journal:
https://catalog.ldc.upenn.edu/LDC94S13A
· North American News Text Corpus:
https://catalog.ldc.upenn.edu/LDC95T21
· Wikipedia
41
Wikipedia
42
Assignment
43
Exercise 01 - Hello CIS
Due: Thursday April 23, 2015, 16:00, i.e., the tag must point to a
commit earlier than the deadline
44
Have fun
45