The Lounge - CodeProject

Re: Hey JSOP! Whoops I mean PIEBALDconsult! Sorry!

honey the codewitch24-Dec-20 9:46

24-Dec-20 9:46

ah, you use a stack. my pull parsers never have. it's a little faster not to, the only hangup is without a stack it's possible to do this '[ "foo":1 ]' because of the fact that the : follows the field name.

It's the one area where the latest parser of mine is not quite compliant. It *will* error on that, just not as soon as it should.

Real programmers use butterflies

Re: Hey JSOP! Whoops I mean PIEBALDconsult! Sorry!

PIEBALDconsult24-Dec-20 10:53

PIEBALDconsult

24-Dec-20 10:53

I think my parser allows that, it trusts that the file is well-formed and doesn't check.
I see no reason to raise an error for that unlikely situation.
Besides, with my parser, every JSONitem has a name (at least an empty one) and a value (and a type), so it doesn't matter whether one is (erroneously) provided or defaulted by my parser.

Now that I think about it more, I don't actually need the Stack.
I could just as easily do something like curr = curr.Parent to step back (up) a level of the tree.
And then the "stack" would be empty when curr is null -- or similar.

Eliminating the Stack probably won't provide a big improvement to the code though.
I'm quite certain any "slowness" is occurring at higher levels, and not in the parser itself.
And, of course, the database access is likely to be the tightest bottleneck.

Re: Hey JSOP! Whoops I mean PIEBALDconsult! Sorry!

honey the codewitch24-Dec-20 11:54

honey the codewitch

24-Dec-20 11:54

DB access times can be improved if you're careful. It pays to check your update times in the DB because you can often improve them by using things like intermediary in-memory tables without constraints on them, and then updating the "real" table with that one transactionally

Of course, obviously profiling is best. I like to time individual things and then check percentage of time within each operation relative to each other so I can know overall where improvements can benefit me. Like for example, DB uses 75% of the time, parsing uses 25% that kind of thing.

Adding, the only thing about a stack is without one you have to scan to the end of a string before you can tell whether you're reading a field or a value node, because the ':' is the only thing you can use to discern that without keeping a stack.

Your parsing might be able to be wholesale improved in .NET by ditching JSON parsing altogether and using carefully constructed regular expressions instead.

Real programmers use butterflies

Re: Hey JSOP! Whoops I mean PIEBALDconsult! Sorry!

PIEBALDconsult24-Dec-20 12:27

PIEBALDconsult

24-Dec-20 12:27

honey the codewitch wrote:
update times

No updates. Truncate/load only. BulkCopy preferably.

honey the codewitch wrote:
tables without constraints

Exactly. I'm loading staging tables for the use of others.

honey the codewitch wrote:
you have to scan to the end of a string before you can tell whether you're reading a field or a value node, because the ':' is the only thing you can use to discern that

Well, you have to read to the end of the string/token anyway, and then you can "peek" the next token to see whether or not it's a COLON, no big deal.
Knowing "I'm in an object, therefore this must be a name", or "I'm in an array, therefore this must be a value" is unnecessary complexity.

honey the codewitch wrote:
using carefully constructed regular expressions instead

Frack no. And that would require loading an entire file into memory, wouldn't it?

Re: Hey JSOP! Whoops I mean PIEBALDconsult! Sorry!

honey the codewitch24-Dec-20 13:00

honey the codewitch

24-Dec-20 13:00

Oh that's right, i forgot that .NET's is in memory only. I've been using my own DFA regex engine for so long now (it streams) that I didn't even think about that.

Also, sorry, I shouldn't have said update, because I meant load.

The other thing I can think of that might speed it up is to orchestrate the loader to be on the same server as the DB depending on the network but it sounds like you probably don't have that ability, based on what you said before it sounds like your environment is restricted. Oh well.

Real programmers use butterflies

Re: Hey JSOP! Whoops I mean PIEBALDconsult! Sorry!

PIEBALDconsult24-Dec-20 13:34

PIEBALDconsult

24-Dec-20 13:34

honey the codewitch wrote:
DFA regex engine ... (it streams)

Then how can it backtrack? Or do right-to-left matching?

honey the codewitch wrote:
loader to be on the same server as the DB

They used to be, in the good ol' days. Then some muppet decided that our SSIS packages should run on a separate server Sigh | :sigh:

After all the work we had to do to ensure we got physical servers rather than VMs. They are in the same datacenter at least.

Re: Hey JSOP! Whoops I mean PIEBALDconsult! Sorry!

honey the codewitch24-Dec-20 15:48

honey the codewitch

24-Dec-20 15:48

DFA engines don't typically (if ever) backtrack. Microsoft's is an NFA engine.

DFA engines are faster, but take longer to compile and support less kinds of matching. Basically DFAs support standard regex ()[^-]*?. but nothing fancy like lazy matching** or atomic zero width assertions.

** apparently someone on CP has produced a research DFA regex engine that can do lazy matches by engaging in some sorcery in the way it builds the states for the machines, but typically they cannot.

Real programmers use butterflies

Dilbert OTD: working from home problems

OriginalGriff23-Dec-20 0:27

OriginalGriff

23-Dec-20 0:27

No Makeup On Zoom[^] OMG | :OMG:

"I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
"Common sense is so rare these days, it should be classified as a super power" - Random T-shirt
AntiTwitter: @DalekDave is now a follower!

Re: Dilbert OTD: working from home problems

rnbergren23-Dec-20 3:51

rnbergren

23-Dec-20 3:51

I have often wondered why they can blur the background but they won't let us blur our faces. The background might at least be interesting.

To err is human to really elephant it up you need a computer

Re: Dilbert OTD: working from home problems

Slow Eddie24-Dec-20 2:18

Slow Eddie

24-Dec-20 2:18

sounds like my driver Roll eyes | :rolleyes:

"Please don't come to my funeral.." Sheldon Cooper

Re: Dilbert OTD: working from home problems

Kirk 1038982124-Dec-20 8:15

Kirk 10389821

24-Dec-20 8:15

Jeffrey... I hope you are picking your nose!

OMG, Turn it off!
Turn it Off!

Zoom Meetings can get out of control pretty quick. The worst thing is the bad audio "ping pong", everyone but ONE person is fine. They get fixed and someone else starts having a problem. LOL. Great Times!

I hope 2021 is AS INTERESTING as 2020!

The weird issues and workarounds we sometimes have to deal with

Jacquers22-Dec-20 21:37

Jacquers

22-Dec-20 21:37

The task I have is pretty basic: Generate a PDF label with some barcodes. To get there is a bit of a mission though, mostly due to the availability (or lack of) tools.

I create an html template and have wkhtmltopdf convert it to pdf. Easy enough, but having precise layout and positioning in html isn't always that easy.

Generating code39 and 128 barcodes is relatively easy with JsBarcode. Except when it doesn't want to display once converted to pdf. Then you find out you have to set both the script and html to utf-8 encoding and then it works.

Generating a 2D pdf417 type barcode is relatively easy with a javascript library, except it fails to display once converted to pdf by wkhtmltopdf. So I find a .Net Core library that can generate the barcode as a png, convert the bytes to a base64 image and use that in the html by replacing placeholder text.

Another hurdle was wkhtmltopdf suddenly becoming very slow after being pretty fast in the past. Finally tracked down the issue to spoolsvc and my default printer being a network printer that's not connected anymore. Once removed the conversion works at a decent speed again.

In short, what should be an easy task had lots of complications and workarounds, some quite weird and difficult to track down, but in the end I learned some interesting things Smile | :)

modified 23-Dec-20 7:29am.

Re: The weird issues and workarounds we sometimes have to deal with

Jörgen Andersson23-Dec-20 0:45

Jörgen Andersson

23-Dec-20 0:45

Then add all the fun you can have with Zebra label printers. Dead | X|

Wrong is evil and must be defeated. - Jeff Ello
Never stop dreaming - Freddie Kruger

Re: The weird issues and workarounds we sometimes have to deal with

OriginalGriff23-Dec-20 0:47

OriginalGriff

23-Dec-20 0:47

That's why I use Avery label sheets and run 'em through my laser printer.

Re: The weird issues and workarounds we sometimes have to deal with

Jörgen Andersson23-Dec-20 2:55

Jörgen Andersson

23-Dec-20 2:55

Well, for manual work that's better, but if you want to automate a bit it's not so fun any more.

Wrong is evil and must be defeated. - Jeff Ello
Never stop dreaming - Freddie Kruger

Re: The weird issues and workarounds we sometimes have to deal with

Jacquers23-Dec-20 0:53

Jacquers

23-Dec-20 0:53

I haven't had to deal with those too much. I've had to work with the Intermec label printers (with the label done in Crystal Reports), but it wasn't too bad.

Re: The weird issues and workarounds we sometimes have to deal with

Jörgen Andersson23-Dec-20 2:52

Jörgen Andersson

23-Dec-20 2:52

When you're handling Crystal reports everything else is great.

Wrong is evil and must be defeated. - Jeff Ello
Never stop dreaming - Freddie Kruger

Re: The weird issues and workarounds we sometimes have to deal with

Rob_P23-Dec-20 21:37

Rob_P

23-Dec-20 21:37

Double fun when using Clipper!

Re: The weird issues and workarounds we sometimes have to deal with

Kirk 1038982124-Dec-20 8:22

Kirk 10389821

24-Dec-20 8:22

Okay,
so we printed labels to go on Damaged Vehicles being kept outside.
They were rubberized and considered "weather-proof"... (Are you guessing how this ends?)...

Turns out, I never bothered to check the operating temperature of the adhesive (okay, I did, but
since it was 10 below Freezing, I thought we were fine).

Windchill can get much colder. so, on a windy morning, we go outside to see about 1,000 Labels flying around, and bunching on the ground and the near the fence. Panic Sets in...

At this point, the conversation the other day (When we put the label on, should we remove the GREASE PEN marker of the Lot #) came to mind, and one was GREATFUL that we decided to leave it on, just in case.

At this point, we revert to PAPER labels with aggressive adhesive and a larger, more forgiving barcode font.

Oh, don't make me think of labels...

Re: The weird issues and workarounds we sometimes have to deal with

Slacker00723-Dec-20 1:14

Slacker007

23-Dec-20 1:14

the paragraph would be so much easier to read, if you split it after every 3-4 lines/sentences.

Just a suggestion. Smile | :)

modified 23-Dec-20 7:34am.

Re: The weird issues and workarounds we sometimes have to deal with

Jacquers23-Dec-20 1:30

Jacquers

23-Dec-20 1:30

Done

Re: The weird issues and workarounds we sometimes have to deal with

Slacker00723-Dec-20 1:33

Slacker007

23-Dec-20 1:33

Thank you.

I was going to delete my post because I realized how much of a jerk I sounded after re-reading it. I will edit the OP to be more civilized.

Blush | :O