Click here to Skip to main content
15,886,611 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
See more: , +
I need to convert a .wav file to text.
What we can hear from .wav file need to write in text file.
Please give solutions for any above technologies.
Posted
Updated 28-May-13 19:59pm
v2
Comments
Sergey Alexandrovich Kryukov 29-May-13 1:59am    
Does not seem to make any sense. What should be written in this text? I can imagine: do, fa dies, sol, cluck, cluck, moo... :-)
—SA
Mayur Panchal 29-May-13 2:02am    
Something like speech to text recognition. But it's a .wav file in place of speech.
Sergey Alexandrovich Kryukov 29-May-13 2:07am    
Aha, this is what I tried to imagine... :-)
And you did not even try to answer my question. What should be on output? Musical notation? In what system? Modern European equal temperament? (Do you know this one and others?) Even if I assume that, or something different, do you understand that a real record may not obey any fixed system?

You cannot be serious. Do you understand that there are noises? What do you want to do with them? Do you know Fourier analysis well enough? Do you understand how difficult the problem can be and why?

—SA

1 solution

Please see my comments to the question. This is my answer.

I also remembered that I tried some open-source products few years ago. Even though the acoustic input was very pure and based on Western equal temperament system (electronic keyboard), none of them demonstrated satisfactory results. It could be considered just experimental effort. I can imagine that by now some products are of near commercial quality (I can imagine the reaction of a good musician; those people can usually catch pretty complex pieces of music on the fly, surpassing any thinkable tools the was we can surpass any language tools in image recognition :-)). I can believe that some considerable progress is possible when working with the team of best developers well familiar with acoustics, mathematics and music, during a number of years. This is really very, very difficult. If you were on such level, you would not probably ask such question.

[EDIT]

Answering after the clarification from the OP.

OK, this is called speech recognition: http://en.wikipedia.org/wiki/Speech_recognition[^].

With .NET, the available technology is easily accessible. You need to use the assembly "speech.dll" (from the GAC, as it is bundled with .NET Framework, so, in Visual Studio, use the tab ".NET" of the "Add Reference" window).

Please see: http://msdn.microsoft.com/en-us/library/system.speech.recognition.aspx[^].

However, don't be too excited. This technology works reasonably well if you simply develop some speech commander for your application, with reasonable number of distinct commands. If you try to perform the free dictation, you can use available dictation grammar: http://msdn.microsoft.com/en-us/library/system.speech.recognition.dictationgrammar.aspx[^].

You can do it, but the results… I would call them frustrating. Anyway, reportedly, even the dictation technology of reasonable quality is commercially available. Maybe it can become a commonplace soon enough…

—SA
 
Share this answer
 
v2
Comments
Mayur Panchal 29-May-13 2:27am    
There is no any musical instrument in my .wav file. It's just normal speech of any person.
i.e. Someone specks "This is test" that need to be written in a textbox.
Sergey Alexandrovich Kryukov 29-May-13 2:42am    
It was too hard to understand from your post. I'll edit the answer.
—SA
Sergey Alexandrovich Kryukov 29-May-13 2:49am    
Answered. Please see again, after [EDIT].
—SA
CPallini 29-May-13 3:15am    
5
Sergey Alexandrovich Kryukov 29-May-13 3:16am    
Thank you, Carlo.
—SA

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900