Succinct Cut 🎥🧹✂

Succinct cut is a video cleaning service for unscripted video content.

Table of Contents

About The Project
Usage Steps
Built With
Pipeline
Roadmap
Retrospective
Contributing
License
Contact
Acknowledgments

About The Project

Unedited videos are full of verbal disfluencies ("huh", "uh", "erm", "um") and long pauses when the speaker is thinking of what to say. Editing such videos manually is tedious and time consuming.

Succinct cut will

Removes glaring disfluencies and hesitations.
Reduce the duration of pauses without cuts.
Return you a final cut that is cleaner and shorter than the original mp4 video upload.

In exchange for

Some of your computer's CPU processing resources
Time

You can find the deployed app here

Performance on i5-8600 CPU desktop computer

Video size/mb	Video duration/min	Final Video Duration/min	Time taken for audio analysis/min	Time Taken for video editing/min
21	1.04	0.53	0.4	11
130	6.44	5.04	3.53	73

(back to top)

Usage Steps

Sign-in with Google
Browse to upload a .mp4 video ( max size: 100mb )
Analyze Video to start video analysis.
Clean Video when progress bar reaches 50% to start video processing. Colored bars will appear below the video to indicate the type of speech (speech, hesitation, pauses) that occurred in the video's timeframe. The bar on the right visualises the cleaned state without hesitations and long pauses.
Download when progress bar reaches 100%

(back to top)

Built With

Frontend

Backend. Authentication, database, storage, functions

Video/audio analysis to get speeech, disfluencies, and pauses

IBM Watson Speech-to-Text

Video conversion, cutting, and editing in the browser

Styling

chakra-ui

Planning

notion

frontend repo

backend repo

(back to top)

Pipeline

(back to top)

Roadmap

[] Fix login bugs
[] Nicer login ui
[] Port from nextjs + firebase to heroku

See the open issues for a full list of proposed features (and known issues).

(back to top)

Retrospective

Framework

NextJS was chosen in a whim as we wanted to explore non create-react-app frameworks. Unexpectedly Vercel serverless functions has a timeout of 15s, which would cut off before a transcription result is received from the IBM watsons speech-text request.

This was extended to 540s when the IBM speech-to-text call was moved to Firebase cloud functions. It is sufficient for proof-of-concept tests of short videos <9min (current approach).

To overcome time-outs, a Create-React-App frontend, Express backend, and the use of IBM async API will be more suitable. The pipeline will be much simpler than the one shown above.

IBM Watsons speech-to-text

Started using this API because a very early idea of succinct cut was to edit videos by deleting words from a transcript. Idea was simplified as it was hard to accomplish that in 2 weeks. Regardless we proceeded with using the API as it will give us timestamps of hesitations, which depending on the video, can be an insignificant feature.

If it was decided initially to process the video based on pauses/long silences, IBM Watsons speech-to-text will not be needed. AudioContext could be used to analyze for silences, and the lengthy audio analysis stage can be shortened.

Conclusion

Despite the choice of methods being less ideal, there was lots to learn from using the Vercel deployment, Firebase, and IBM watsons speech to text.

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Contact

Jia En - @ennnm_ - jiaen.1sc4@gmail.com Shen Nan - @wongsn - wongshennan@gmail.com

(back to top)

Acknowledgments

Fireship

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
components		components
images		images
lib		lib
pages		pages
public		public
refTranscriptData		refTranscriptData
styles		styles
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
audio.aac		audio.aac
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
pipline-flowchart.md		pipline-flowchart.md
transcript2.json		transcript2.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Succinct Cut 🎥🧹✂

Succinct cut is a video cleaning service for unscripted video content.

About The Project

Succinct cut will

In exchange for

Performance on i5-8600 CPU desktop computer

Usage Steps

Built With

Frontend

Backend. Authentication, database, storage, functions

Video/audio analysis to get speeech, disfluencies, and pauses

Video conversion, cutting, and editing in the browser

Styling

Planning

Pipeline

Roadmap

Retrospective

Framework

IBM Watsons speech-to-text

Conclusion

Contributing

License

Contact

Acknowledgments

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

Ennnm/succinct-cut

Folders and files

Latest commit

History

Repository files navigation

Succinct Cut 🎥🧹✂

Succinct cut is a video cleaning service for unscripted video content.

About The Project

Succinct cut will

In exchange for

Performance on i5-8600 CPU desktop computer

Usage Steps

Built With

Frontend

Backend. Authentication, database, storage, functions

Video/audio analysis to get speeech, disfluencies, and pauses

Video conversion, cutting, and editing in the browser

Styling

Planning

Pipeline

Roadmap

Retrospective

Framework

IBM Watsons speech-to-text

Conclusion

Contributing

License

Contact

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages