Skip to content

rvhirsch/QuestionDetection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Files

  • audiodata/audioinfo.py: inside audiodata dir, run with $ python3 audioinfo.py
  • audiofiles/splitaudio.py: splits audio into 1 minute increments, puts new files in splitfiles directory
  • textdata/transcribe.py: takes 1 minute wav file and transcribes, splitting by sentence
  • textdata/concatfiles.py: takes all transcriptions and puts them into single file
  • textdata/transcribe.sh: inside textdata dir, run with $ ./transcribe.sh [case-number]

Text

  • sentences_google.csv: from google speech to text transcript
  • sentences_ibm.csv: from ibm speech to text transcript

In each row:

  1. full sentence text
  2. sentence start time in audio files
  3. sentence end time in audio file
  4. corresponding audio file id
  5. label: 0 = statement, 1 = question

Audio

Input: vocal speech - one sentence Output: 13 parameters based on slices of 0.08 sec

  1. min pitch value
  2. max pitch value
  3. pitch range (max-min)
  4. mean pitch
  5. median pitch
  6. check if pitch increases in 2nd half of statement
  7. total pitch increasing
  8. count of increasing slices
  9. total pitch decreasing
  10. count of decreasing slices
  11. check if increasing total > decreasing total
  12. count of nonzero pitches

Audio Data Sources

Important Links

Parameters 1-12 based on: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.563.1655&rep=rep1&type=pdf

Some code based on:

Packages:

Convert MP3 to wav:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published