C# Discussion Boards - CodeProject

Re: Which Constructor is Better?

PIEBALDconsult16-Apr-13 15:01

PIEBALDconsult

16-Apr-13 15:01

I'd probably use a const, but have you considered using an enumeration?

Jasmine2501 wrote:
wouldn't that take a memory location we don't need to use?

Possibly, but that kind of thinking can lead to defines -- at one place I worked (in C) the standard was to put such values in defines to "save space". Roll eyes | :rolleyes:

Re: Which Constructor is Better?

Jasmine250117-Apr-13 5:36

Jasmine2501

17-Apr-13 5:36

But that doesn't really save space, right? Using #define puts the literal value into your final code, right? So, if it's used in multiple places, you're actually wasting memory (code size) with #define, but only if the value is used more than once.

Re: Which Constructor is Better?

Matt T Heffron17-Apr-13 9:53

Matt T Heffron

17-Apr-13 9:53

The const for the string is really the better way.
Remember that in c#, strings are invariant. So the compiler can and will automatically use the same actual string no matter how often it is referenced. Referencing a string const multiple times, or using the identical string literal multiple times is the same. Only one string will be stored in the program and all of the references to it will be to the exact same object.

So why did I say the const for string is better?
For maintainability. The name of the const can (should) be based on the functional use of the value, and can be updated in a single location, guaranteed to affect all uses. With string literals it is easy to miss one Smile | :)

Re: Which Constructor is Better?

Jasmine250117-Apr-13 9:57

Jasmine2501

17-Apr-13 9:57

Yeah I agree, I just thought since the string was being created as an object, it's going to get stored in memory somewhere, in addition to the place where it's stored in the code.

Re: Which Constructor is Better?

Matt T Heffron17-Apr-13 10:02

Matt T Heffron

17-Apr-13 10:02

But no matter how you code it, const or literal, it must be an object at run-time!
The compiler arranges for it to always be the same object.
This is from the c# spec document:
For instance, the output produced by

class Test
{
  static void Main() {
    object a = "hello";
    object b = "hello";
    System.Console.WriteLine(a == b);
  }
}

is True because the two literals refer to the same string instance.

Re: Which Constructor is Better?

Jasmine250117-Apr-13 10:04

Jasmine2501

17-Apr-13 10:04

Well no, if you stick it in there as a literal, it's only stored once, in the code.

Unless you're saying...

Console.Writeln("Hello World");

...creates a string object in memory?

Your IF is true above because you used the equivalence operator, which compares the values, not the pointers.

Re: Which Constructor is Better?

Matt T Heffron17-Apr-13 10:07

Matt T Heffron

17-Apr-13 10:07

Yes. Exactly.
The receiving method must get a string object. That is all it knows how to deal with!

See: http://msdn.microsoft.com/en-us/library/system.string.intern.aspx[^]

modified 17-Apr-13 16:19pm.

Re: Which Constructor is Better?

Matt T Heffron17-Apr-13 13:35

Matt T Heffron

17-Apr-13 13:35

Regarding == of the strings, the same result (true) is displayed if the comparison is changed to: object.ReferenceEquals(a,b)
They really are the same object.

Re: Which Constructor is Better?

V.16-Apr-13 20:23

16-Apr-13 20:23

It's probably personal. I prefer option 2, because it does not allow for confusion and keys rarely or never change name.

V.

(MQOTD Rules and previous Solutions )

Re: Which Constructor is Better?

Orjan Westin16-Apr-13 22:45

Orjan Westin

16-Apr-13 22:45

There are a number of issues here, when speaking about a generic answer rather than one strictly limited to the examples given, with one potentially important one so far not mentioned: the assignment will happen at different times.

In your first example, SettingValue is assigned after all base constructors have been executed. In your second example, which uses field initialisation, SettingValue is assigned before any base constructors have been executed.

This may affect what exceptions are thrown on construction errors. For instance, a ConfigurationErrorsException will be thrown if the value is not found. In the first example, you can catch this in the constructor and set a default, or give a more detailed exception. In the second example, you can't, and will have to rely on it being caught outside the class.

And if a base constructor would also throw an exception, in your first example this is what will be thrown, while in the second example it's the ConfigurationErrorsException that will be thrown.

Re: Which Constructor is Better?

Richard Deeming17-Apr-13 0:39

Richard Deeming

17-Apr-13 0:39

Orjan Westin wrote:
a ConfigurationErrorsException will be thrown if the value is not found

Not in the AppSettings section; if the specified key doesn't exist, it will return null.

However, you can get a ConfigurationErrorsException if the configuration file is corrupt.

"These people looked deep within my soul and assigned me a number based on the order in which I joined."
- Homer

Re: Which Constructor is Better?

Orjan Westin17-Apr-13 1:58

Orjan Westin

17-Apr-13 1:58

Oh, I wasn't aware of that. Thanks.

Re: Which Constructor is Better?

Jasmine250117-Apr-13 5:39

Jasmine2501

17-Apr-13 5:39

Thanks! This is what I'm looking for. I didn't think about the order of things, that might be important. Also thanks for giving me the names of the processes, I had never heard "field initialization" before.

Re: Which Constructor is Better?

jschell17-Apr-13 9:18

jschell

17-Apr-13 9:18

Option 1 because if it fails then the cause of the failure is less likely to be confusing.

And yes the code can fail.

Re: Which Constructor is Better?

Jasmine250117-Apr-13 9:28

Jasmine2501

17-Apr-13 9:28

Any code can fail, but if this fails, it's a fatal error and if the application blows up, that's fine.

Loading Very Large DataSet Without losing any information

losan16-Apr-13 7:23

losan

16-Apr-13 7:23

Hi All;

I have a large dataset stored in a CSV file (about 40,000 Rows and 10,000 Columns). I need to load it into a C# Windows application. So, any idea to do this. I tried different code, but some are able to loao 40,000 R * 255 C, and other codes are able to load 5,000 R and 10,000 C.

Thanks

losan1985

Re: Loading Very Large DataSet Without losing any information

Dave Kreskowiak16-Apr-13 7:58

Dave Kreskowiak

16-Apr-13 7:58

It's probably going to take rolling your own custom class to hold it all. I don't know of anything "off-the-shelf" that will hold 10,000 columns. Frankly, I've never even HEARD of such a wide CSV file ever being used.

It shouldn't be very hard at all to create a List<list<int>> or whatever your item data type is. Basically, a List of List of Integers.

A guide to posting questions on CodeProject[^]

Dave Kreskowiak

Re: Loading Very Large DataSet Without losing any information

SledgeHammer0116-Apr-13 8:05

SledgeHammer01

16-Apr-13 8:05

You probably can't load it all at once (at least in a 32-bit OS). 40,000 x 10,000 = 400,000,000 bytes if each cell is 1 byte. If you assume an average of 16 bytes (since you didn't say) per cell, thats 6,400,000,000 bytes = 5GB of data. You only have 2GB of address space for your application. You can do it on a 64-bit OS though.

With that being said, I doubt you really need 40,000 x 10,000 cells loaded in memory at once. What is a person going to do with all that data?

You might want to consider loading only the portion you need.

Re: Loading Very Large DataSet Without losing any information

Pete O'Hanlon16-Apr-13 9:33

Pete O'Hanlon

16-Apr-13 9:33

What is your actual requirement? You are loading this data for a reason. What is that reason? For instance, are you performing some calculation on certain columns? By breaking down your requirements, we can work out a practical solution.

I was brought up to respect my elders. I don't respect many people nowadays.

CodeStash - Online Snippet Management | My blog | MoXAML PowerToys | Mole 2010 - debugging made easier

Re: Loading Very Large DataSet Without losing any information

Rockstar_16-Apr-13 18:24

Rockstar_

16-Apr-13 18:24

yes, this may solve your problem....

Re: Loading Very Large DataSet Without losing any information

Mycroft Holmes16-Apr-13 17:23

Mycroft Holmes

16-Apr-13 17:23

As POH has said your design has to be wrong for this to be a valid requirement. Go back and look at how the CSV was created, why does it require 10k columns (what a ridiculous number). Can your source break it up into more swallowable chunks. Do you need all 10k columns.

Can you load and process 1 row at a time, presumably you want to dump this into some more reasonable format.

Never underestimate the power of human stupidity
RAH

Re: Loading Very Large DataSet Without losing any information

V.16-Apr-13 22:09

16-Apr-13 22:09

You'll need to build in a sort of paging mechanism that only loads that part that is shown on the screen.

V.

(MQOTD Rules and previous Solutions )

Re: Loading Very Large DataSet Without losing any information

BobJanova17-Apr-13 4:23

BobJanova

17-Apr-13 4:23

As Sledgehammer01 says, that's an unreasonably large amount of data for most purposes. It's 400 million cells and so you're talking about GB of memory, depending on exactly what's in there. What do you want to do with this dataset? You almost certainly want a load-on-demand adapter of some kind, so you can run through the data without actually having it all in memory at once.

This library is rather good; I used it in a real application (though not dealing with massive datasets) without problem.

Re: Loading Very Large DataSet Without losing any information

Alan Balkany18-Apr-13 4:55

Alan Balkany

18-Apr-13 4:55

Hi losan,

I found your post very interesting because I've never encountered a data set that large. Are you trying to analyze that data? If so, I may be able to help.

I have a product (www.patternscope.com) that finds patterns in extremely large data sets. I think your data set would be good for stress-testing the application, and it fits perfectly with two planned developments:

1. Reading CSV data (currently it only reads databases through ODBC, or flat files), and
2. Making a C#-callable API that you could use in your C# application to handle that much data (e.g. queries, retrieval, and analysis).

My product extracts the patterns that comprise the raw data. These patterns are a fraction of the size of the original raw data, so they fit entirely into memory, even when the raw data is larger than the memory available.

The patterns have the same information content as the raw data, so can be processed (e.g. queries or analysis) many times faster.

If you could give me a copy of your data set, I could give you a free copy of PatternScope (after I adapt it for reading CSV data) which you could use to analyze the data, followed by a DLL you could call from C# for processing the data in your program.

What does this data represent?

Re: Loading Very Large DataSet Without losing any information

losan20-Apr-13 7:21

losan

20-Apr-13 7:21

Hi;

Here is a link for the Dataset
"www.dropbox.com/s/een9zlqce4vqqrl/ProjectData3.csv"

What I need to do is to apply the collaborative filtering algorithms in the dataset. The data set is about Tweets, who is going to retweet from another person.

Thanks

losan1985

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.