|
If I had one file, I wouldn't have asked. I have at least 100 files, and have heard them once at least, and would like to find whether certain keywords occur in them. Something like batch-processing is what I an looking for. That's why I asked.
|
|
|
|
|
Fur enuff
Life should not be a journey to the grave with the intention of arriving safely in a pretty and well-preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming “Wow! What a Ride!" - Hunter S Thompson - RIP
|
|
|
|
|
Run the files through a (batch) speech recognition program and dump the whole thing to a text file. You can (re)search the text without having to go back to the mp3. Looking at a file in its entirety will help in deciding how to "train" the (ML) process.
Transcribe your recordings
"Before entering on an understanding, I have meditated for a long time, and have foreseen what might happen. It is not genius which reveals to me suddenly, secretly, what I have to say or to do in a circumstance unexpected by other people; it is reflection, it is meditation." - Napoleon I
|
|
|
|
|
Yes and no.
The similarities can be washed out by the noise in the sentence, particularly with all of the different accents/varieties of English that are spoken around the world.
Also, just identifying word endings can be tricky. Picking out words from "rainy days" can give "rainy days", "ray knee days" or "ray need as'" without changing the sound.
|
|
|
|
|
The answer in that link leads to the VOSK project[^], which in turn has a page that lists several projects that integrate the VOSK toolkit. One of those projects is pretty much exactly what the OP is asking for: mp4grep[^]. Looks like it's Linux-only, and using it is a two-step process unless you happen to be starting with a 16khz wav file, but it appears to automate the process of using speech recognition to generate a timestamped transcript, and then grep'ing the result to find the word or expression of interest. I didn't actually try to use it, so I can't vouch for its quality, but it looks pretty neat if it works!
|
|
|
|
|
use a text to speech tool that generates .VTT close captioning files. Each snippet of a few words will have a timestamp to indicate when it's displayed.
Azure has an API for it, and I think AWS has recently upgraded their video to text processing tools to offer the same.
Did you ever see history portrayed as an old man with a wise brow and pulseless heart, weighing all things in the balance of reason?
Is not rather the genius of history like an eternal, imploring maiden, full of fire, with a burning heart and flaming soul, humanly warm and humanly beautiful?
--Zachris Topelius
|
|
|
|
|
To get a feeling for how reliable such systems are, get yourself a YouTube account so that you can upload a few files there, and play them back selecting 'Auto generated' subtitles.
A few years ago, that function tried to autogenerate subtitles even from instrumental music, which could lead to some really funny results. Today, it just says [music], if it cannot detect any vocal part. For vocal music, it quite often misinterprets. For speech, it is surprisingly good, as long as the background noise level is low, the speech is distinct and only one person at a time is speaking.
For English only, of course. (I assume that those who really put resources into this kind of stuff also has high performance versions for Russian speech, but don't expect that to be released to the civilian society .)
|
|
|
|
|
I'm well aware of the limitations in computerized speech to text. The versions offered as services by the big tech companies are the least bad ones available though.
I've never looked into generating non-english transcripts/captions.
Did you ever see history portrayed as an old man with a wise brow and pulseless heart, weighing all things in the balance of reason?
Is not rather the genius of history like an eternal, imploring maiden, full of fire, with a burning heart and flaming soul, humanly warm and humanly beautiful?
--Zachris Topelius
|
|
|
|
|
MP3 to text conversion would be step 1?
"A little time, a little trouble, your better day"
Badfinger
|
|
|
|
|
I suggest you google "speech to text" that is the industry terminology.
"If we don't change direction, we'll end up where we're going"
|
|
|
|
|
assuming you don't have a life what a great research project.
Me? I just want to learn how to hang interior doors square, but we all have our goals.
Charlie Gilley
“They who can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety.” BF, 1759
Has never been more appropriate.
|
|
|
|
|
Would it be enough to use a program that can do transcripts, and then you can search through the resulting text?
No timestamps, but whether that's sufficient for you depends on the use case...
|
|
|
|
|
|
Thanks a lot. Will download and take a look.
|
|
|
|
|
You are welcome.
|
|
|
|
|
Arrr, mateys! It be Talk loik a Pirrrrate day agin!
Do pirates go to the Apple store to get a new iPatch?
Arrr 3.14 men at sea πrates?
I do hope Charlie remembers what day it is traditionally when he makes his eulogy ...
"I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
"Common sense is so rare these days, it should be classified as a super power" - Random T-shirt
AntiTwitter: @DalekDave is now a follower!
|
|
|
|
|
Are there that many ex-services here that understand the R&R? I may be mistaken but have never heard it used by civvies before....
Who the f*** is General Failure, and why is he reading my harddisk?
|
|
|
|
|
IIRrrC, they even used it in Starrr Trrrek a couple of times ...
"I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
"Common sense is so rare these days, it should be classified as a super power" - Random T-shirt
AntiTwitter: @DalekDave is now a follower!
|
|
|
|
|
R&R is a well understood expression here in US.
"A little time, a little trouble, your better day"
Badfinger
|
|
|
|
|
I would assume that the service you are referring to is the Arrrmy?!?!
If you can't laugh at yourself - ask me and I will do it for you.
|
|
|
|
|
I suppose all (Brit) services use it (we in the Army did)...
just I haven't heard the expression in the last 30 odd years or so in a civvie context...
Getting too old I suppose and no longer have my fingers on the pulse of life
Who the f*** is General Failure, and why is he reading my harddisk?
|
|
|
|
|
“If there’s a man among ye, ye’ll come up and fight like the man ye are to be!” - Mary Read
I don't speak Idiot - please talk slowly and clearly
"I have sexdaily. I mean dyslexia. Fcuk!"
Driven to the arms of Heineken by the wife
|
|
|
|
|
I thought that was Mary Berry?
"I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
"Common sense is so rare these days, it should be classified as a super power" - Random T-shirt
AntiTwitter: @DalekDave is now a follower!
|
|
|
|
|
Time for me to row the boat -- Treasure awaits (every other Tuesday, minus taxes)!
|
|
|
|
|
Ye smell like ye been swimmin' in the bilge, ye feckless, mangy sea snake! ... Where's the grog?!
|
|
|
|