Click here to Skip to main content
15,900,725 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more: , +
Hello,

I'm building a web tool for bioinformatics field. I have to deal with DNA sequence files which will exceed 2 GB for each file and I'll have dozens of text files. I will have a DB that keep these DNAs for me. My question is will it be more efficient to read the txt file and store its content in a table record or store the path of the txt file and read it every time I need to access it ?? or is there another way ?
I'm using SQL server 2008, VS 2010 asp.net and c#
Posted

1 solution

Probably storing the path of of the file would be better, but since you haven't explained anything about how you're going to access and use the data, it's pretty hard to tell.

For example, are your clients just going to read the entire file and use every bit of or are you expected to index all of this data and search for random parts of it??

There's a lot more information require than just "what's the best way to store this"...
 
Share this answer
 
Comments
AseelHadlaq 23-Feb-13 14:55pm    
Thanks Dave,

The user will enter a motif and the system will search for that motif in one or more of the text files as the user request. I have a search algorithm that I want to apply.
Dave Kreskowiak 23-Feb-13 19:48pm    
Soooo...how are you going to search for this "motif", whatever that is?? Since you already have a search algorith you want to use, that is kind of going to dictat how you store this data.
AseelHadlaq 24-Feb-13 13:51pm    
a motif is like "ACCCGTA" a part of the DNA that I want to search for its occurrences in the whole DNA file.
I will access a file or many files as many times as the user wants to search for that motif in different DNA files.
The search algorithm does not depend on the file size, but my concern would be is it faster to store the data in a record or just store the path and each time a user request a certain file I go and read it.

Thanks a lot
Dave Kreskowiak 24-Feb-13 14:57pm    
Since you're searching these files, your problem isn't storing the file, but indexing the data. How long do you think it takes to read an entire 2GB file and search of a substring?
[no name] 23-Feb-13 15:11pm    
My 5! Dave. It's good advice to keep path than complete data of file.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900