There are some subtle differences to the original Nvidia implementation

#1
by hlevring - opened

https://huggingface.co/spaces/ysdede/parakeet.js-demo

W Canvid, every recording becomes smarter and easier to work with. It automatically generates a full transcript of your video, complete with time-accurate word timestamps. You can simply click any word to jump straight into that exact moment. Perfect for precise edits and effortless navigation. Canvid's cancellation cleans up your audio by removing background noise and enhancing your voice in real time. You can also use AI background blur or full background removal to stay focused and fully immersed in your presentation. And if you make a small mistake, no problem. With AI audio retakes, just highlight the part you want to fix, then record or type your correction. No need to re-record the entire video. Best of all, even without a live webcam, Canvid can create a realistic synthetic camera feed from your best previous recording, giving your videos a professional, personal touch every time....

https://huggingface.co/spaces/nvidia/parakeet-tdt-0.6b-v2

With Canvid, every recording becomes smarter and easier to work with. It automatically generates a full transcript of your video, complete with time-accurate word timestamps. You can simply click any word to jump straight into that exact moment. Perfect for precise edits and effortless navigation. Canvid's noise cancellation cleans up your audio by removing background noise and enhancing your voice in real time. You can also use AI background blur or full background removal to stay focused and fully immersed in your presentation. And if you make a small mistake, no problem. With AI audio retakes, just highlight the part you want to fix, then record or type your correction. No need to re-record the entire video. Best of all, even without a live webcam, Canvid can create a realistic synthetic camera feed from your best previous recording, giving your videos a professional, personal touch every time.

--- ysdede/parakeet.js-demo
+++ nvidia/parakeet-tdt-0.6b-v2
@@ -1,1 +1,1 @@
-W Canvid, every recording becomes smarter and easier to work with.
+With Canvid, every recording becomes smarter and easier to work with.

@@
-Canvid's cancellation cleans up your audio by removing background noise and enhancing your voice in real time.
+Canvid's noise cancellation cleans up your audio by removing background noise and enhancing your voice in real time.

@@
-giving your videos a professional, personal touch every time....
+giving your videos a professional, personal touch every time.

(Using webgpu float32)

I could try to have a look, but perhaps you might have some idea already?

Hi. Thanks for your feedback. It is fixed now.
https://github.com/ysdede/parakeet.js/issues/6

ysdede changed discussion status to closed

Sign up or log in to comment