LEL2B: MapTask files for LEL2B - Tutorial/02 Monday 14:10, Brandon Kieffer



Below are the set of dialogue files for your tutorial group. Each student should annotate roughly 100-300 lines of dialogue from the file(s) assigned to them. If your file(s) have more than 300 lines, you only need to annotate 100-300 lines -- but you're welcome to do more!

Deadline for annotation: 12pm NOON on Friday 14 November. Submit your annotations directly to Hannah (Hannah.Rohde@ed.ac.uk).

What's in each file? The first row contains column headers describing the contents of that column (e.g., "dialogue" gives the name of the dialogue, "familiarity" indicates whether the speakers knew each other or not). The next 13 rows show the practice sample dialogue from class; this just provides a reminder of what you need to do for annotation. The following rows show the dialogue that needs annotation, with empty cells in the columns for the primary annotation ("expression", "referent", "mention", "indefiniteOrDefinite", "isPronoun") and the bonus annotation ("duration").

How to access your files? As will be explained in the week 8 course materials, each student will be working with one or more files. You'll download your file(s) onto your computer, open http://sheets.google.com, open a blank spreadsheet, and then go to the spreadsheet 'File' menu, select 'Import', find the 'Upload' tab, and select the file you downloaded. You will need to click the 'Import data' button.

What do you need to do? Once the file is open, inspect the top rows to understand the different columns. The first 13 annotated lines of dialogue should look familiar from class. You then need to continue the annotation process for the rest of the file. You have a choice of whether to complete only the primary annotation of referring expressions (finding and annotating the expressions for the referent/mention/indefiniteOrDefinite/isPronoun properties) or to also measure the acoustic duration (looking up identical repeated expressions to measure their durations in the .wav files here: https://groups.inf.ed.ac.uk/maptask/signals/dialogues/). For full details about the annotation process, see the annotation instructions from class. Real data is messy -- note down any questions that come up!

To get started: Click on the link(s) below to download the file(s) you'll be working on.

Student Download links
Acharyyaq5ec8.txt (177 lines)
q5nc3.txt (124 lines)
Allenq6ec4.txt (163 lines)
q8nc3.txt (137 lines)
De'Athq2ec3.txt (214 lines)
q4ec1.txt (86 lines)
Hamptonq8ec7.txt (159 lines)
q8ec2.txt (156 lines)
Jensenq1nc4.txt (240 lines)
R. Liq2ec5.txt (227 lines)
Z. Liq6ec7.txt (166 lines)
q7nc3.txt (162 lines)
Liuq2nc4.txt (188 lines)
q7nc8.txt (125 lines)
Mayoral Vaqueroq3ec1.txt (243 lines)
Quinq6ec8.txt (177 lines)
q4ec6.txt (126 lines)
Thompsonq1ec5.txt (167 lines)
q5nc8.txt (137 lines)
Youngq3nc4.txt (179 lines)
q3ec4.txt (168 lines)