How to convert .wav file to text

Question

1.00/5 (1 vote)

See more:

, +

I need to convert a .wav file to text.
What we can hear from .wav file need to write in text file.
Please give solutions for any above technologies.

Posted 28-May-13 19:57pm

Mayur Panchal

Updated 28-May-13 19:59pm

v2

Add a Solution

Comments

Sergey Alexandrovich Kryukov 29-May-13 1:59am

Does not seem to make any sense. What should be written in this text? I can imagine: do, fa dies, sol, cluck, cluck, moo... :-)
—SA

Mayur Panchal 29-May-13 2:02am

Something like speech to text recognition. But it's a .wav file in place of speech.

Sergey Alexandrovich Kryukov 29-May-13 2:07am

Aha, this is what I tried to imagine... :-)
And you did not even try to answer my question. What should be on output? Musical notation? In what system? Modern European equal temperament? (Do you know this one and others?) Even if I assume that, or something different, do you understand that a real record may not obey any fixed system?

You cannot be serious. Do you understand that there are noises? What do you want to do with them? Do you know Fourier analysis well enough? Do you understand how difficult the problem can be and why?

—SA

1 solution

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Sergey Alexandrovich Kryukov · Accepted Answer · 2013-05-28T20:15:00

Please see my comments to the question. This is my answer.

I also remembered that I tried some open-source products few years ago. Even though the acoustic input was very pure and based on Western equal temperament system (electronic keyboard), none of them demonstrated satisfactory results. It could be considered just experimental effort. I can imagine that by now some products are of near commercial quality (I can imagine the reaction of a good musician; those people can usually catch pretty complex pieces of music on the fly, surpassing any thinkable tools the was we can surpass any language tools in image recognition :-)). I can believe that some considerable progress is possible when working with the team of best developers well familiar with acoustics, mathematics and music, during a number of years. This is really very, very difficult. If you were on such level, you would not probably ask such question.

[EDIT]

Answering after the clarification from the OP.

OK, this is called speech recognition: http://en.wikipedia.org/wiki/Speech_recognition[^].

With .NET, the available technology is easily accessible. You need to use the assembly "speech.dll" (from the GAC, as it is bundled with .NET Framework, so, in Visual Studio, use the tab ".NET" of the "Add Reference" window).

Please see: http://msdn.microsoft.com/en-us/library/system.speech.recognition.aspx[^].

However, don't be too excited. This technology works reasonably well if you simply develop some speech commander for your application, with reasonable number of distinct commands. If you try to perform the free dictation, you can use available dictation grammar: http://msdn.microsoft.com/en-us/library/system.speech.recognition.dictationgrammar.aspx[^].

You can do it, but the results… I would call them frustrating. Anyway, reportedly, even the dictation technology of reasonable quality is commercially available. Maybe it can become a commonplace soon enough…

—SA