Click here to Skip to main content
15,907,906 members
Please Sign up or sign in to vote.
1.80/5 (2 votes)
See more:
Hi All,
I am working on a site scrapping tool, I am using C# class "WebClient" to download site content. After some time(usually 5-10 minutes) some sites response with captcha image, Is there any way to bypass the captcha. I have tried to use different free proxies but many of them dont work. I have also tried to insert fair amount of time interval between consecutive calls but all in vain.

Is there any solution for that problem or there is some inherent problem in my methodology?

Thanks
Posted
Comments
Richard MacCutchan 18-May-12 6:29am    
Captchas are put on websites specifically to protect them from automated access like the above. If there was a simple way to bypass them then they would not be of any use.

No, it's is an inherent problem in your methodology

If there was, the Captcha system would quickly be updated to prevent you getting round it.

That is exactly what captcha is there for: to make sure it is a human being browsing, not a web bot.
 
Share this answer
 
You can use the API of KnowCaptcha to solve the problem.
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900