Click here to Skip to main content
15,895,799 members

Welcome to the Lounge

   

For discussing anything related to a software developer's life but is not for programming questions. Got a programming question?

The Lounge is rated Safe For Work. If you're about to post something inappropriate for a shared office environment, then don't post it. No ads, no abuse, and no programming questions. Trolling, (political, climate, religious or whatever) will result in your account being removed.

 
GeneralRe: Quiet day today, isn't it? Pin
OriginalGriff21-Dec-19 6:26
mveOriginalGriff21-Dec-19 6:26 
GeneralRe: Quiet day today, isn't it? Pin
Kornfeld Eliyahu Peter21-Dec-19 6:40
professionalKornfeld Eliyahu Peter21-Dec-19 6:40 
GeneralRe: Quiet day today, isn't it? Pin
Cp-Coder21-Dec-19 7:03
Cp-Coder21-Dec-19 7:03 
GeneralRe: Quiet day today, isn't it? Pin
RickZeeland21-Dec-19 9:57
mveRickZeeland21-Dec-19 9:57 
GeneralRe: Quiet day today, isn't it? Pin
Mark_Wallace21-Dec-19 10:19
Mark_Wallace21-Dec-19 10:19 
GeneralRe: Quiet day today, isn't it? Pin
Richard MacCutchan21-Dec-19 21:41
mveRichard MacCutchan21-Dec-19 21:41 
GeneralRe: Quiet day today, isn't it? Pin
GenJerDan22-Dec-19 21:55
GenJerDan22-Dec-19 21:55 
GeneralPaging Mr. Cthulhu... Pin
PIEBALDconsult20-Dec-19 16:19
mvePIEBALDconsult20-Dec-19 16:19 
Mr. Cthulhu, please pick up a white courtesy phone in the Lounge.

Sooo... as part of my problem dealing with a "bad" internal web site (PHP donthaknow) -- Ignoring certificate errors with HTTPS[^] -- I spent much of the last week Parsing Html The Cthulhu Way[^] .

It turned out that by using a System.Windows.Forms.WebBrowser and retrieving the Document, I was actually getting a sanitized version of the HTML (and only the body). This is a problem, because when common sense breaks out, there will be no certificate error, no reason to use the System.Windows.Forms.WebBrowser, and I expect that I will then receive the entire nasty pile of HTML in its raw form (fingers crossed).

So, this week I looked into accessing the raw HTML from the System.Windows.Forms.WebBrowser ... I accessed its privates, and grabbed it by the primary Interop assembly. And, by gum, it worked. Big Grin | :-D

"What did you find?" I hear you ask. It's more what I didn't find. The page contains most of a TABLE (as expected), but a few start tags are missing OMG | :OMG: -- unimportant ones, like THEAD, TR, and TH.

Can you then fault me for summoning Cthulhu? What self-respecting HTML parser will deal with such a mess? (Other than IE, of course).

(Deep breath.) I spent today wrestling with HtmlAgilityPack, which dealt pretty well with the errors (TagNotOpened) and I managed to use the errors to insert the missing start tags where they logically belong. Big Grin | :-D Nifty. Perfect effort for the last day before a week off.

So, provided I can deploy HtmlAgilityPack to the server, I may be able to cancel my summons. In the meantime, I have the RegEx version on the server.


Now, at the risk of asking a Programming Question... does anyone know how to get HtmlAgilityPack to report TagNotClosed errors as well? It has an error type for it, but I haven't gotten it to report any.
GeneralRe: Paging Mr. Cthulhu... Pin
BillWoodruff21-Dec-19 18:53
professionalBillWoodruff21-Dec-19 18:53 
GeneralWhy can't you set up an email rule... Pin
dandy7220-Dec-19 10:58
dandy7220-Dec-19 10:58 
GeneralRe: Why can't you set up an email rule... Pin
Mark_Wallace20-Dec-19 12:03
Mark_Wallace20-Dec-19 12:03 
AnswerRe: Why can't you set up an email rule... Pin
Ravi Bhavnani20-Dec-19 12:39
professionalRavi Bhavnani20-Dec-19 12:39 
General:D Pin
Eddy Vluggen20-Dec-19 15:15
professionalEddy Vluggen20-Dec-19 15:15 
GeneralMost of the time.... Pin
Marc Clifton20-Dec-19 9:54
mvaMarc Clifton20-Dec-19 9:54 
GeneralRe: Most of the time.... Pin
  Forogar  20-Dec-19 10:20
professional  Forogar  20-Dec-19 10:20 
GeneralRe: Most of the time.... Pin
Marc Clifton20-Dec-19 14:26
mvaMarc Clifton20-Dec-19 14:26 
GeneralRe: Most of the time.... Pin
Gerry Schmitz20-Dec-19 17:23
mveGerry Schmitz20-Dec-19 17:23 
GeneralFrom the Microsoft Health and Safety info Pin
OriginalGriff20-Dec-19 8:49
mveOriginalGriff20-Dec-19 8:49 
GeneralRe: From the Microsoft Health and Safety info Pin
lopatir20-Dec-19 9:11
lopatir20-Dec-19 9:11 
GeneralRe: From the Microsoft Health and Safety info Pin
OriginalGriff20-Dec-19 9:42
mveOriginalGriff20-Dec-19 9:42 
GeneralRe: From the Microsoft Health and Safety info Pin
Mark_Wallace20-Dec-19 12:06
Mark_Wallace20-Dec-19 12:06 
GeneralRe: From the Microsoft Health and Safety info Pin
Marc Clifton20-Dec-19 9:35
mvaMarc Clifton20-Dec-19 9:35 
GeneralRe: From the Microsoft Health and Safety info Pin
OriginalGriff20-Dec-19 9:47
mveOriginalGriff20-Dec-19 9:47 
GeneralRe: From the Microsoft Health and Safety info Pin
dandy7220-Dec-19 9:52
dandy7220-Dec-19 9:52 
GeneralRe: From the Microsoft Health and Safety info Pin
OriginalGriff20-Dec-19 10:38
mveOriginalGriff20-Dec-19 10:38 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.