Here are some of the materials we have produced in the course of our work in Casamance, click on the links to download:
We are in the course of finalizing a corpus of multilingual language use data.This rich annotated multimedia corpus of multilingual language use at the Crossroads will be the centre piece of our joint research. Since the main phase of data collection is over, the bulk of data is already deposited to an online repository. To date, the corpus contains a total of 516 session bundles, including ca. 150 hours of time-aligned, transcribed and translated speech data with ELAN annotations for speaker and language. Of this total number of hours of recorded spoken data, 95 hours were obtained within the lifespan of the current project. Each session bundle consists of a folder, labeled with a mnemonic session name that indicates the location in which the recording took place, the date, and the researcher’s initials, minimally of a sound file and its transcription and annotation , together with a metadata file which lists the languages spoken and the participants in the recording, their age, gender, reported language repertoires, and ethnicity/village. As the corpus coordinator, Abbie Hantgan spent the third project year creating a corpus structure, uploading and unifying data and metadata formats and formalizing common corpus searches through scripts in R. She has edited a corpus manual that, although written for the Crossroads project, might be of use to language documenters and multilingualism researchers interested in possible workflows and detailed scenarios on how to create, harmonize and search field-based corpus. It can be downloaded here:
This is Alexander Cobbinah‘s bilingual dictionary of Baïnounk Gubëeher:
Here are two maps of Brin and Djibonker:
The movie below shows Alpha Naby Mané in a short documentary he created on the use of Ajami writing in Mandinka, which is widely practiced in Casamance.