Calculate runtime on n log(n) big O notation?

Question

0.00/5 (No votes)

See more:

I'm learning a Coursera course about time complexity. I encounter this in a video and I think something is wrong. "So suppose that we have an algorithm whose runtime is roughly proportional to n and we want it to run it on a machine that runs at about a gigahertz. How large an input can we handle such that we'll finish the computation in a second? " (1GHZ CPU (10^9 cycles per second))

"Well if it runs at about size n, you can handle about a billion sized inputs, before it takes more than a second" (10^9 / 10^9)

"If it runs like n squared, it's a lot worse. You can only handle inputs of size about 30,000 before it starts taking more than a second." n = 30000, (30000)^2/10^9 = 0.9 sec ~ 1 sec

"If the inputs are of size 2 to the n, it's incredibly bad, you can only handle inputs of size about 30 in a second" 2^30/10^9 ~ 1.07 sec

With n log(n) complexity, if the input size n is 10^9, then it should take 10^9 * log(10^9) / 10^9 = 9 second, right? Somehow it shows 30 seconds in the video. It also said that if n = 10^7.5 then the runtime is 1 second, which is also wrong with my calculation.

I calculated every case with the n, n^2, 2^n complexity and everything was right, only nlog(n) case messed up

Here is the photo for more details: https://i.upanh.org/2022/04/11/nlogn.jpg[^]

Linked the course video (3:40 to 5:00 is the part): https://www.coursera.org/lecture/algorithmic-toolbox/asymptotic-notation-zI8dH[^]

I edited the question so that it's just like in the course video I watched

What I have tried:

I calculated every case with the n, n^2, 2^n complexity and everything was right, only nlog(n) case messed up

Posted 11-Apr-22 2:57am

Hoang Minh Quang FX15045

Updated 11-Apr-22 12:30pm

v4

Add a Solution

6 solutions

Solution 1

Quote:
If the input size n is 10^9, then it should take 10^9 * log(10^9) / 10^9 = 9 second, right?

No ! You don't calculate runtime from Big O notation, at least not by itself.
The Big O notation tells you the category complexity of an algorithm, not the runtim, not directly.
When you have a runtime for given data size, the Big O complexity allow you to evaluate the runtime for another data size.
your timing of 1 second for data from 20 to 10^6 means that you need to switch to miliseconds timing to get meaningful timing.

Posted 11-Apr-22 3:36am

Patrice T

Comments

Hoang Minh Quang FX15045 11-Apr-22 10:36am

Sorry, I just watched the video again. There is this line "So suppose that we have an algorithm whose runtime is roughly proportional to n and we want it to run it on a machine that runs at about a gigahertz. How large an input can we handle such that we'll finish the computation in a second? " I also updated the question with some lines in the course video.

Solution 2

The big O doesn't give the exact time of execution. See, for instance Big O notation - Wikipedia[^] (Properties sections, multiplication by a constant).

Posted 11-Apr-22 3:36am

CPallini

Comments

Hoang Minh Quang FX15045 11-Apr-22 10:36am

Sorry, I just watched the video again. There is this line "So suppose that we have an algorithm whose runtime is roughly proportional to n and we want it to run it on a machine that runs at about a gigahertz. How large an input can we handle such that we'll finish the computation in a second? " I also updated the question with some lines in the course video.

Solution 3

To add to what the others say, you can't equate clock cycles directly with execution time anyway - some machine operations take n cycles to execute, others may be n * 2. To add to that, modern processors are pipelined - so some instructions may start while other are executing, cached - so execution time may depend on how long ago an instruction was last executed and if its data is available in L1, L2 or L3 caches, availability of free cores to execute, 'plus a host of other details.

You can't just say "my clock speed is X, so my execution time is X * Y" - there are far too many other factors that will influence it.

Posted 11-Apr-22 4:07am

OriginalGriff

Comments

Hoang Minh Quang FX15045 11-Apr-22 10:37am

Sorry, I just watched the video again. There is this line "So suppose that we have an algorithm whose runtime is roughly proportional to n and we want it to run it on a machine that runs at about a gigahertz. How large an input can we handle such that we'll finish the computation in a second? " I also updated the question with some lines in the course video.

Solution 6

the big O notation only tells you how the execution time will change when the "size" of the problem is changed, it does not provide an actual estimate for a specific case.

In other words, there is an unknown factor, so for example O(n2) could mean

time_needed = constant * n * n

where nobody is telling what the value of the constant is, hence no concrete value for time_needed.

The example only tells you that a problem twice as large will take four times as long to get solved.

Posted 11-Apr-22 12:30pm

Luc Pattyn

Solution 7

Big O notation talks about generalizations.
its similar to if you were doing size estimates in lets say economics: such that 1000 and 750 would be both rounded to 1000 and what is interesting is 10K is not in the same ballpark of 100K but 13K is in the ball park of 20K and 10K (range).

if you have 20$ and you wonna buy a dish, and you know its supposed to cost somewhere in the 4$ range, as long as its not 2N, N^2 you dont care..5$, 6$ ok, 7$ too high, 8$ (a level of 2N) not acceptable.

in CS we want to know what will the complexity of a problem be.

if you have N=3, O(N) or of 2N or 3N (3,6,and 9) respectively, doesn't seem like a big deal.
moreover N^2 would give 9 as well, so why is it even such an issue?

lets change N to be 15M (15 Million tweets lets say) like 15,000,000 a day,
now 15M, 30M, and 90M, are all million numbers
so if it takes 1(millisecond) ms to load a tweet, it would take 15x10^6 / 1x10^3 = 15x10^(6-3=3) = 15000 seconds ~ 4+ hours to read 15 million records on one thread.
2N, 3N... its all in a day, its terrible, but you can live with it if need be.
N^2 would jump to no feasible implementations.15Million^2 is such a big number, your great grandson might not live to see that computation end i think (i might be exaggerating here, didnt check)

so for huge datasets, this is very very important.
for smaller datasets, its just messy. and sometimes moving from 2N to N might make the code less readable, maintenance to be sketchy, and other factors go into this.

moving from 12ms to 10.2ms on a client machine retrieving type suggestions for instance is a waste of time.
if you have 1000 contacts and you dont keep them sorted, when you pull them to read them, you need to pass over each O(N) and sort them, only then can you find the median, the average or any other requirement you have.

if you keep them sorted in a binary tree for example, you would only need log2N = log2of1000 which is 10ish iterations instead of 1000 + 10

if this were 15million contacts, you need to take big O into more consideration
hope this helps someone

Posted 17-Jan-24 7:56am

Mickey P

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Greg Utas · Accepted Answer · 2022-04-11T04:45:00

Solution 5

30 seconds is about right if it's log₂(n). log₂(1000) ~= 10.

EDIT: O(n log(n)) algorithms often recursively divide the problem in half, solve each part separately, and then combine the solutions. This is why log₂ is usually implied. Mergesort is a good example.

Posted 11-Apr-22 4:45am

Greg Utas

Updated 11-Apr-22 5:21am

v2

Comments

Hoang Minh Quang FX15045 11-Apr-22 14:22pm

OMG thank you!! I just searched on google, in programming when saying log(a), it usually means log2(a) not log10(a) like when I learned at high school.