LEL2B: MapTask files for LEL2B - Tutorial/07 Tuesday 12:10, Brandon Kieffer



Below are the set of dialogue files for your tutorial group. Each student should annotate roughly 100-300 lines of dialogue from the file(s) assigned to them. If your file(s) have more than 300 lines, you only need to annotate 100-300 lines -- but you're welcome to do more!

Deadline for annotation: 12pm NOON on Friday 14 November. Submit your annotations directly to Hannah (Hannah.Rohde@ed.ac.uk).

What's in each file? The first row contains column headers describing the contents of that column (e.g., "dialogue" gives the name of the dialogue, "familiarity" indicates whether the speakers knew each other or not). The next 13 rows show the practice sample dialogue from class; this just provides a reminder of what you need to do for annotation. The following rows show the dialogue that needs annotation, with empty cells in the columns for the primary annotation ("expression", "referent", "mention", "indefiniteOrDefinite", "isPronoun") and the bonus annotation ("duration").

How to access your files? As will be explained in the week 8 course materials, each student will be working with one or more files. You'll download your file(s) onto your computer, open http://sheets.google.com, open a blank spreadsheet, and then go to the spreadsheet 'File' menu, select 'Import', find the 'Upload' tab, and select the file you downloaded. You will need to click the 'Import data' button.

What do you need to do? Once the file is open, inspect the top rows to understand the different columns. The first 13 annotated lines of dialogue should look familiar from class. You then need to continue the annotation process for the rest of the file. You have a choice of whether to complete only the primary annotation of referring expressions (finding and annotating the expressions for the referent/mention/indefiniteOrDefinite/isPronoun properties) or to also measure the acoustic duration (looking up identical repeated expressions to measure their durations in the .wav files here: https://groups.inf.ed.ac.uk/maptask/signals/dialogues/). For full details about the annotation process, see the annotation instructions from class. Real data is messy -- note down any questions that come up!

To get started: Click on the link(s) below to download the file(s) you'll be working on.

Student Download links
Cuiq2nc2.txt (290 lines)
Cyriaxq8ec4.txt (303 lines)
Davisonq6nc4.txt (262 lines)
De Almeida Barretoq5ec1.txt (303 lines)
De Witq1ec3.txt (303 lines)
Hanq5nc4.txt (305 lines)
Hoodq7ec2.txt (314 lines)
Jackq8nc5.txt (330 lines)
Jonesq2nc3.txt (333 lines)
Kepinskaq8ec1.txt (338 lines)
Linnellq4nc2.txt (349 lines)
Lodhaq8ec6.txt (381 lines)
Ogawaq7nc2.txt (388 lines)
Petrouq4nc6.txt (391 lines)
Stanburyq1nc2.txt (411 lines)