The Lounge - CodeProject

Re: strpbrk() in Microsoft's VCLib is *slow*

dandy7229-Jan-21 12:21

dandy72

29-Jan-21 12:21

Like I said, I was just playing dumb. I fully understand that some of these things are probably out of your control.

Re: strpbrk() in Microsoft's VCLib is *slow*

honey the codewitch29-Jan-21 12:26

honey the codewitch

29-Jan-21 12:26

Fair enough. Yeah, JSON is used for big data for better or worse, like XML was. There are *some* advantages to a lexical format when it comes to transmitting numbers across platform, but nowadays the binary representations are so standardized anyway that aside from byte order it doesn't matter. But people will do what they will do.

Real programmers use butterflies

Re: strpbrk() in Microsoft's VCLib is *slow*

k505429-Jan-21 6:06

k5054

29-Jan-21 6:06

honey the codewitch wrote:
Does anyone know if GCC will work on Windows without some virtual env like MiniGW installed?

It's probably not the compiler so much as the supplied libc. You could peruse the source for glibc and try to write your own strpbrk() based on that. Though the odd occasion that I've tried to spelunk through the glibc sources, about 50% of the time it becomes a bit like trying to solve a maze in Zork. And, of course, it raises possible GPL issues, so maybe looking at the BSD sources might be a better choice.
Wasn't there a post a few weeks ago bout SSE and string ops? Maybe that's a direction to consider if you want to write your own.

On a related note, it might be interesting to see how a WSL instance compares in performance to a Windows native instance. The results of that might be skewed though, if WSL uses virtual disks, so maybe a better comparison would be Linux and Windows, both in a VM on the same host. At least then both instances would have the same virtual disk drivers, so, presumably, the difference would be down to the strpbrk() implementation.

Keep Calm and Carry On

Re: strpbrk() in Microsoft's VCLib is *slow*

honey the codewitch29-Jan-21 6:30

honey the codewitch

29-Jan-21 6:30

i was the one that brought up the simd string processing. i may have to create my own SIMD optimized strpbrk() function for my lib just for the windows build.

Oddly enough - and I don't know if this is still true - but apple's standard libraries and OS calls were heckin fast compared to other major offerings. it was about the only nice thing i could say about them.

I'm pretty sure it's MS's standard libraries that are the problem in this case - specifically strpbrk.

I just wonder why they're not better optimized? I haven't disassembled them yet, but what i've seen of GCCs (which i *have* disassembled) it's using SIMD pretty much entirely. I doubt microsoft's is, based on the performance alone, which should be orders of magnitude faster.

Real programmers use butterflies

Re: strpbrk() in Microsoft's VCLib is *slow*

k505429-Jan-21 6:46

k5054

29-Jan-21 6:46

Just a really dumb question. You're sure you're looking at Release build, and not a Debug build? I'm not sure that that would even matter, unless MS provides an unoptimized libc for debugging?

Keep Calm and Carry On

Re: strpbrk() in Microsoft's VCLib is *slow*

honey the codewitch29-Jan-21 6:59

honey the codewitch

29-Jan-21 6:59

I've tried building it in release under several different configurations (different architectures and optimizations) and I'm not getting much difference, leading me to believe strpbrk() is not optimized using simd unlike gcc's stdlib implementation

Real programmers use butterflies

Re: strpbrk() in Microsoft's VCLib is *slow*

Stuart Dootson29-Jan-21 6:49

Stuart Dootson

29-Jan-21 6:49

You're not wrong...

Here's some code that scans through a 1GB string (finding a character at teh very end of it) with the four equivalent but different ways I could think of (std::string::find_first_of, std::string_view::find_first_of, std::find_first_of and strpbrk):

C++

 #include <algorithm>
 #include <chrono>
 #include <cstring>
 #include <iostream>
 #include <string>

int main()
{
   std::string s(size_t(1024) * 1024 * 1024, ' ');
   s.back() = 'c';

   auto start = std::chrono::steady_clock::now();
   auto x = s.find_first_of("abc");
   auto end = std::chrono::steady_clock::now();
   std::cout << "std::string::find_first_of -> " << x << " in "
             << std::chrono::duration_cast<std::chrono::milliseconds>(end - start).count()
             << " ms" << std::endl;

   start = std::chrono::steady_clock::now();
   std::string_view s_as_view{s.c_str(), s.size()};
   auto x1 = s_as_view.find_first_of("abc");
   end = std::chrono::steady_clock::now();
   std::cout << "std::string_view::find_first_of -> " << x1 << " in "
             << std::chrono::duration_cast<std::chrono::milliseconds>(end - start).count()
             << " ms" << std::endl;

   start = std::chrono::steady_clock::now();
   std::string needle{"abc"};
   auto x2 = std::distance(std::begin(s), std::find_first_of(std::begin(s), std::end(s), 
                           std::begin(needle), std::end(needle)));
      end = std::chrono::steady_clock::now();
   std::cout << "std::find_first_of -> " << x2 << " in "
             << std::chrono::duration_cast<std::chrono::milliseconds>(end - start).count()
             << " ms" << std::endl;

   start = std::chrono::steady_clock::now();
   auto y = std::distance(s.c_str(), strpbrk(s.c_str(), "abc"));
   end = std::chrono::steady_clock::now();
   std::cout << "strpbrk -> " << y << " in "
             << std::chrono::duration_cast<std::chrono::milliseconds>(end - start).count()
             << " ms" << std::endl;
}

and here's the output when compiled with cl.exe -std:c++17 -Ob2 -O2 -Os -EHsc a.cpp and run on the i7-6820HQ in my work laptop:

std::string::find_first_of -> 1073741823 in 552 ms
std::string_view::find_first_of -> 1073741823 in 557 ms
std::find_first_of -> 1073741823 in 2741 ms
strpbrk -> 1073741823 in 2359 ms

That's about 1.8GB/s for the first two, and around 423MB/s for strpbrk. However, when compiled with gcc-10 (with the command g++-10 -o ./a a.cpp -O3 -std=c++17) on Ubuntu 18.04 (same laptop - I'm using WSL), I get this:

std::string::find_first_of -> 1073741823 in 3341 ms
std::string_view::find_first_of -> 1073741823 in 3563 ms
std::find_first_of -> 1073741823 in 715 ms
strpbrk -> 1073741823 in 122 ms

That ranges from 300MB/s for the first two to about 8.2GB/s for strpbrk...

honey the codewitch wrote:
Does anyone know if GCC will work on Windows without some virtual env like MiniGW installed?

MinGW is actually OK - Cygwin is the 'gcc on Windows' that introduces nastiness. As this site says, "MinGW is a port of GCC to Windows. ... It produces standalone Windows executables which may be distributed in any manner." I'd use the distro from that site, or maybe one from this site

Java, Basic, who cares - it's all a bunch of tree-hugging hippy cr*p

Re: strpbrk() in Microsoft's VCLib is *slow*

Sander Rossel29-Jan-21 9:24

Sander Rossel

29-Jan-21 9:24

Naming things is hard, but really? strpbrk? Laugh | :laugh:

string pointer...? b? return? k? Really, what?

Best,
Sander

Azure DevOps Succinctly (free eBook)
Azure Serverless Succinctly (free eBook)
Migrating Apps to the Cloud with Azure
arrgh.js - Bringing LINQ to JavaScript

Re: strpbrk() in Microsoft's VCLib is *slow*

honey the codewitch29-Jan-21 9:42

honey the codewitch

29-Jan-21 9:42

const char * strpbrk ( const char * str1, const char * str2 );
char * strpbrk ( char * str1, const char * str2 );
Locate characters in string
Returns a pointer to the first occurrence in str1 of any of the characters that are part of str2, or a null pointer if there are no matches.

The search does not include the terminating null-characters of either strings, but ends there.

"string pointer break" seems closest. The person that named it was probably drunk at the time.

Real programmers use butterflies

Re: strpbrk() in Microsoft's VCLib is *slow*

Sander Rossel29-Jan-21 11:19

Sander Rossel

29-Jan-21 11:19

honey the codewitch wrote:
"string pointer break" seems closest

Hah, amateur! I'd have gone for "spb" Big Grin | :-D

Best,
Sander

Azure DevOps Succinctly (free eBook)
Azure Serverless Succinctly (free eBook)
Migrating Apps to the Cloud with Azure
arrgh.js - Bringing LINQ to JavaScript

Re: strpbrk() in Microsoft's VCLib is *slow*

honey the codewitch29-Jan-21 11:30

honey the codewitch

29-Jan-21 11:30

Knowing the C stdlib it was probably already used for something. Laugh | :laugh:

Real programmers use butterflies

Re: strpbrk() in Microsoft's VCLib is *slow*

k505429-Jan-21 10:15

k5054

29-Jan-21 10:15

String Pointer BReaK. But you probably knew that. What you may not know is that this goes back to the dawn of Unix on a PDP with actual, real teletypes as I/O devices. Punching the keys on them was hard, so anything that could be abbreviated was. Thus cp, mv and ls rather than copy, move and list. Sure, only 2 chars each (abbrev, again!), but at the end of a day stabbing at the keys, it would make a difference ... if only meant you could pick up that beer without wincing.

Keep Calm and Carry On

Re: strpbrk() in Microsoft's VCLib is *slow*

Sander Rossel29-Jan-21 11:19

Sander Rossel

29-Jan-21 11:19

k5054 wrote:
But you probably knew that

I really didn't, never heard of it WTF | :WTF:

I thought all that old stuff was abbreviated to save memory.
I'm a young-ish (30+ already) programmer, not a historian Laugh | :laugh:

There are still plenty of developers around who write their SQL Server tables like slsord, custmr, prodct, etc.
For web and desktop development, there is no place for such abbreviations in 2021 (or any time after 2000 for that matter) WTF | :WTF:

You'd miss your beer trying to figure out what the hell "ordcum" means... (an actual abbreviation I remember (because of juvenile humor Laugh | :laugh:

) which did not mean "order customer" (but I can't remember what it did mean...))

Best,
Sander

Azure DevOps Succinctly (free eBook)
Azure Serverless Succinctly (free eBook)
Migrating Apps to the Cloud with Azure
arrgh.js - Bringing LINQ to JavaScript

Re: strpbrk() in Microsoft's VCLib is *slow*

k505430-Jan-21 3:50

k5054

30-Jan-21 3:50

Sander Rossel wrote:
I thought all that old stuff was abbreviated to save memory.

Sort of. I seem to recall that early linkers had only 8 (or maybe 16) character limit for external identifiers, so that too played a part in the name of system functions.

Keep Calm and Carry On

Re: strpbrk() in Microsoft's VCLib is *slow*

trønderen29-Jan-21 12:30

trønderen

29-Jan-21 12:30

I am sure that you are right.

My next question is how much time your typical application spends inside stpbrk(). I can imagine that you can set up testbeds where it exceeds one percent of the total CPU load. That is for a testbed.

Can you set up a true, user level, application solving a true user problem, where more than a single percent of the CPU time is spent inside stpbrk()? At a single percent, doubling the speed of spbrk() might speed up the application by a whooping half percent. Woooah!

Sure: I see that thirty or seventy-five such optimizations together might be significant, taken as a whole. So go ahead with the twenty-nine, or seventy-four, other optimizations. Then serve the pudding.

The proof of the pudding is the pudding you serve to the end user.

Re: strpbrk() in Microsoft's VCLib is *slow*

honey the codewitch29-Jan-21 13:38

honey the codewitch

29-Jan-21 13:38

I am using real world data collected from an online repository at TMDB.com. I have 200kB of actual data from their repository, and then I synthesized 20MB of similar data in the same schema. I could have downloaded 20MB of JSON from TMDB.com. The only problem is then I'm downloading 20MB of data from tmdb.com and their rate limiting will hate me.

Now, for a real world scenario, where you're actually using TMDB's data, you'll likely end up mirroring their repository as you retrieve parts of it. For example, their repository contains every show and movie you'll find at IMDb.com, but in JSON format. Now if I only want shows from 2019, i can get those, but the point is this process is like fetch on request, and then cache. If you were to retrieve all the data then the entire repository would be mirrored locally.

It is from this mirror that i'd want to extract data.

So yes, that's a real world scenario. I even have a C# library that does this for tmdb.com already, but not using this json parser, which is in C++.

I've profiled it using the GNU profiler on linux, but nothing else.

Most of the function time is in strpbrk() at least for long scans.

More importantly, I know my actual throughput. I'm currently getting 2/3 of the throughput I got on a linux machine, on a windows machine whose hardware is maybe 10 times as fast or more.

And i know what function primarily impacts that throughput because I've already profiled. Smile | :)

It's skipToAny() which in the best case, uses strpbrk() - it can't on arduinos but it will on windows.

Real programmers use butterflies

Microsoft's Clownish Developer Prompt

honey the codewitch29-Jan-21 1:17

honey the codewitch

29-Jan-21 1:17

Want to use MS VC++ under windows with VS Code?

Good luck. Microsoft in their infinite wisdom

A) Set it so you can't use MSVC without running a batch file first
B) Made the batch file completely unreadable. I can't even tell where it sets PATH. How do you even do that?
C) Is just generally is terrible.
D) Negates all the "Run VS Code here" shell extensions since they are useless because you need to launch code from the batch file in order for it to work.

Tell me: Why in the world would *anyone* think it was a good idea to install MSVC++, not put it in the PATH, and then make it near impossible for you to do it yourself? Why?

Do they *want* me to move away from Windows for all my C++ development?

Real programmers use butterflies

Re: Microsoft's Clownish Developer Prompt

OriginalGriff29-Jan-21 1:20

OriginalGriff

29-Jan-21 1:20

Well, possibly they want you to move away from C++ for all your Windows development ... Laugh | :laugh:

"I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
"Common sense is so rare these days, it should be classified as a super power" - Random T-shirt
AntiTwitter: @DalekDave is now a follower!

Re: Microsoft's Clownish Developer Prompt

honey the codewitch29-Jan-21 1:35

honey the codewitch

29-Jan-21 1:35

Probably. I just opened a nastygram of an issue over at the VS Code C++ extension's github repo.

This is just unacceptable.

Real programmers use butterflies

Re: Microsoft's Clownish Developer Prompt

Randor 29-Jan-21 1:50

Randor

29-Jan-21 1:50

Hmmm, All you need to do is execute vcvarsall.bat to setup your environment?

honey the codewitch wrote:
Tell me: Why in the world would *anyone* think it was a good idea to install MSVC++, not put it in the PATH

Because most C++ developers are using multiple tools and compilers (and multiple compiler versions). Keep in mind that Visual Studio allows you to compile with older versions of CL and ancient linkers.

Re: Microsoft's Clownish Developer Prompt

honey the codewitch29-Jan-21 2:18

honey the codewitch

29-Jan-21 2:18

Yes I know how to do that.

That is not the problem.

The problem is that that kills workflow. I can no longer click on my project folder and go "Open with VS Code" because Microsoft stinks.

If microsoft didn't stink (like that will ever happen) they would run vcvars from inside VS Code when you're using the C++ extension

Adding: "people use ancient compilers" isn't really an acceptable justification. Workflow should not be killed when using the standard compiler just because you might use an ancient one. You can OVERWRITE environment variables, after all. Linux gets it. Microsoft is clueless.

Real programmers use butterflies

Re: Microsoft's Clownish Developer Prompt

Randor 29-Jan-21 2:27

Randor

29-Jan-21 2:27

Well,

VS Code is open sourced under MIT license so you are free to modify the behavior. Or you can open an issue to request a new feature. It sounds like a great feature to add to the VS Code C++ extension.

Re: Microsoft's Clownish Developer Prompt

honey the codewitch29-Jan-21 2:38

honey the codewitch

29-Jan-21 2:38

I've opened an issue already. Smile | :)

Real programmers use butterflies

Re: Microsoft's Clownish Developer Prompt

Randor 29-Jan-21 2:51

Randor

29-Jan-21 2:51

honey the codewitch wrote:
I've opened an issue already.

Great.

In the old days I would always manually remove the build tools from the %PATH% environment variable.

I don't know how long you've been developing with C++ on Windows but in the old days (90's- 2000's) the build tools were added to the %PATH% environment variable. But it was causing alot of problems because developers would install the Windows SDK which had it's own compiler and linker. Also device driver developers would install the DDK which yet again had it's own compiler and linker. Then there were some guys (like me) that would have VC6,VS2005,VS2008,VS2010,VS2012.NET,VS2013 all installed on the same workstation. I was so happy when VS2015 allowed me to compile with older versions. It meant that I didn't have to install 5 different versions of Visual Studio.

Best Wishes,
-David Delaune

Re: Microsoft's Clownish Developer Prompt

honey the codewitch29-Jan-21 3:01

honey the codewitch

29-Jan-21 3:01

Randor wrote:
which had it's own compiler and linker

I think I see the problem right there. I wonder why microsoft didn't?

Randor wrote:
Then there were some guys (like me) that would have VC6,VS2005,VS2008,VS2010,VS2012.NET,VS2013 all installed on the same workstation.

Sane thing (meaning not in the cards for MS): set the env vars to the latest compiler, and allow the user to run a batch file to set the env vars for the older compilers. Better yet, make the latest compiler support older compilation**

** which would have been easier if microsoft hadn't spent years ignoring the C++ standard

Insane thing: Make everyone's life harder by not having sane defaults, and by using crap compilers for years before finally deciding that standards matter.

Real programmers use butterflies

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

Welcome to the Lounge