vBulletin Search Engine Optimization
| |||||||
| Register | FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read |
| ||||
| Hi all, I'm working on my first data warehouse and I'm not sure how I should name the columns in the database. The first phase of the data warehouse is to store a bunch of data from one third party source. The source contains over 100 pieces of data and the business user doesn't even know what some of the fields are but he wants to store everything. The third party refers to the each field with a somewhat cryptic short name and a longer description. The short name isn't always cryptic. My question is am I better off naming my columns the same as the source system's short name so that I can easily debug problems later? Should I instead try to shorten their definition into something meaningful? On a side note, I'm 100% positive that we'll never populate the tables in questions with data from an additional source. Thanks! |
| |||
| This is just my opinion (before everybody flames me) but I wouldn't use the short names. In a data warehouse and as a general practice, it is better to be descriptive so that people in the future will understand the design of the system. If possible for your system you might want to try to mimic the names of the fields that are in the report application. This way DBA and techies will understand them and also regular business users, if there's such a need -- Sincerely, John K Knowledgy Consulting http://knowledgy.org/ Atlanta's Business Intelligence and Data Warehouse Experts "Jim" <jim.gaull@gmail.com> wrote in message news:649ac820-136e-4a04-a74c-2edec37b4bcd@u69g2000hse.googlegroups.com... > Hi all, > > I'm working on my first data warehouse and I'm not sure how I should > name the columns in the database. > > The first phase of the data warehouse is to store a bunch of data from > one third party source. The source contains over 100 pieces of data > and the business user doesn't even know what some of the fields are > but he wants to store everything. The third party refers to the each > field with a somewhat cryptic short name and a longer description. > The short name isn't always cryptic. > > My question is am I better off naming my columns the same as the > source system's short name so that I can easily debug problems later? > Should I instead try to shorten their definition into something > meaningful? On a side note, I'm 100% positive that we'll never > populate the tables in questions with data from an additional source. > > Thanks! |
| ||||
| "Jim" <jim.gaull@gmail.com> wrote in message news:649ac820-136e-4a04-a74c-2edec37b4bcd@u69g2000hse.googlegroups.com... > Hi all, > > I'm working on my first data warehouse and I'm not sure how I should > name the columns in the database. <snip/> > My question is am I better off naming my columns the same as the > source system's short name so that I can easily debug problems later? > Should I instead try to shorten their definition into something > meaningful? On a side note, I'm 100% positive that we'll never > populate the tables in questions with data from an additional source. I concur with John K. Use descriptive names, especially when looking down the road if you're planning to build data marts from your data warehouse. A lot of client applications expose the column names to business users, so it's a good idea to keep the column names in the data marts descriptive so that business users will not be confused. By the same token, since many client tools expose the column names to users, try not to make the names too long. I've run into problems with nondescriptive column names and too-long column names with tools like ProClarity on the front end. From a technical perspective it makes administration and troubleshooting easier if your column names are somewhat human-readable. |