Main Menu

SSTS Blog

Some news and tidbits that we share

paste

by SSTS on Dec 20 in Blog 0 Comments

Learned something new recently while having the task of manipulating large files from a data provider. You see, I needed to gather several years of financial information for several thousand companies (at the daily level). There were about 30 attributes so this process needed to be run about 30 times with 30 resulting files. After it was done it all needed to be loaded into a database. I could have loaded the files individually, but in the end all the data needed to be joined. I actually tried loading it all into the database and let the database doe the join, however these were such large data sets that the memory required to do such a join was larger than I had. I had a "wouldn't it be great" moment wondering if there was a way to join the files together in a streaming fashion. The order of the lineswas such that line 1 of each file could be joined together in a consistent way (the all belonged to the same security and the same date)
I was able to run the follwing command after placing all cvs files in a directory:
paste -d ',' *csv > all_data.csv
After that I had one monster file ready to load to the database - pre-joined and all!

Tags: Untagged

Database Managed Services - 2008 Strategy for Serv... »

Author

About the author

SSTS

Server Side Technology Solutions is a consulting firm that specializes in database design, development and support.

http://www.serverside-ts.com

Comments

Please login first in order for you to submit comments

Lower Your Costs

Lower the costs of your DBA while ensuring that you have the quality that your company needs to remain successful and optimized!

24 x 7 Monitoring

For things that are essential to your success, do not let potential hazards stand in the way. With our 24x7 Monitoring Alert System, we can make sure your systems are always running as efficiently as they should be, even if you're sleeping.

Get A Solution

We have extensive experience in the field which allows us to approach every situation head on and find the best solution quickly. Our solutions are custom tailored to your needs. We will work with you in coming up with a services agreement that fits your needs.

Solutions...