Skip to content

Masters project for the University of Washington's Master of Science in Computational Linguistics program. Concerns genre classification of audio recordings for low resource languages.

License

Notifications You must be signed in to change notification settings

adoxography/streamlined-genre

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

streamlined-genre

Installation

  1. Ensure that python (>=3.6), Java, and openSMILE are installed. python3, java, and SMILExtract should be available on the PATH.

  2. Ensure that pip, wheel, and setuptools are installed and up-to-date

pip install --upgrade pip wheel setuptools

  1. Clone the repository and cd into it

git clone https://github.com/adoxography/streamlined-genre && cd streamlined-genre

  1. Install the streamlined-genre package

pip install .

If you're developing streamlined-genre, you might want to install it with the optional dev dependencies instead:

pip install -e ".[dev]"

Usage

Data preparation

ELAR

streamlined-genre is designed to work with directories generated from Endangered Languages Archive (ELAR) content. The directories should be constructed as follows:

{LANGUAGE}
├── Bundles
│   └── {RECORDING_TITLE}
│       └── {AUDIO_FILE}
└── {LANGUAGE}_ELAR_Directory.csv

{LANGUAGE}_ELAR_Directory.csv should be structured as follows, where {LABEL} is the instance's genre label. Audio files may be in wav or sph format.

...
{RECORDING_TITLE},...,{AUDIO_FILE},...{LABEL}
...

Non-ELAR

streamlined-genre can also work with arbitrary wav files. The files should be in the same folder and be named {IDENTIFIER}__{LABEL}.wav (where {IDENTIFIER} is unique across all samples).

Command line executable

Once installed, the executable streamlined-genre is available on the command line. It requires at least one action flag:

Flag Action Requires
-t Transfers audio files from ELAR directories --wavs and at least one --source
-e Extracts LLDs and labels from wav files --wavs and --compiled
-x Compiles bags of words from LLD files --compiled
-c Classifies bags of words --compiled

Action flags may be combined; e.g. -texc.

For a full description of command line arguments, run streamlined-genre --help.

License

This project is licensed GPL v.3. It incorporates the following third party packages, which have their own licenses:

About

Masters project for the University of Washington's Master of Science in Computational Linguistics program. Concerns genre classification of audio recordings for low resource languages.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages