Tags
calendar dimension, Data warehouse, date and time dimension, datetime dimension, Microsoft SQL Server, TSQL
Calendar Tables – Why You Need One.
<credit: This section was copied from Made2Mentor>
What is a Date (Calendar) Table?
For the purposes of this article a Date Table is a dedicated table containing a single record for each day in a defined range. They include fields with descriptive attributes for each day such as Year, Month, Week, and whether a particular day is a work day or not. Date Tables (Dimensions) are an integral part of every Data Warehouse, but they can be used with a regular (OLTP) database as well.
Why do I need a Date Table?
There are several situations in which a date table is extremely helpful. Let’s consider just a few. Suppose you need to….
- Generate a running sales total per day and include days in which you have no sales such as weekends.
- Calculate the number of workdays between two dates.
- Calculate projected ship dates for products based upon lead times.
- Aggregate data by non-standard Fiscal Years or Periods. Perhaps your company uses the Fiscal Year of the US Government which runs from October 1st until September 30th.
- Need to track data by seasons, whatever that means to your company. For example, if you manufactured bicycles you’d certainly want to compare sales figures in the spring versus fall.
- Need to compare data using rolling months/periods. Suppose your company wants to know how March sales for this year compared to March for the past 5 years. Or you want to compare sales which occurred on Fridays to each other to look for trends.
- Need to create Gantt Charts.
While many of these issues can be handled by writing T-SQL code, they’re much easier with a dedicated date table. Also, the Date Table can be customized to track anything about a particular day you want.
A script to create a calendar table
I was recently faced with the challenge of creating a calendar dimension with hourly grain for reporting on a Microsoft SQL Server data warehouse application
There are two parts to the solution. A script to create and populate a table and a function used to generate integer keys from dates.
The script.
/* Title: Script to create a dateTime dimension with hourly granularity. Author: James Bayley james@elephantpm.com http://www.elephantpm.com published at http://blog.jamesbayley.com based on a script published by Joost van Rossum http://microsoft-ssis.blogspot.co.uk/2011/01/create-and-fill-time-dimension.html Purpose: This script creates and populates a dateTime dimension (calendar dimension) with hourly grain. The key is an integer of the form YYYYMMDDHH For each dateTime in your fact table you must introduce a similar integer key. For example if you have a "SaleDateTime" you must create "iSaleDateTime" which represents this dateTime as an integer of the form YYYYMMDDHH. You use this table in a left out join with your fact tables to ensure that you get a record for each hour you want to report. You can of course aggregate at higher levels. Comments: This script is easily extended to include other columns such as financial year and local times or you may increase the granularity to minutes or seconds by adding additional loops Copyright: (C) Please include the author block above. Licence Creative Common Attribution 2.0 http://creativecommons.org/licenses/by/2.0/uk/ */ drop table dbo.ElephantPM_Dim_DateTime go -- Create your dimension table -- Adjust to your own needs Create Table dbo.ElephantPM_Dim_DateTime ( iDateTime int PRIMARY KEY CLUSTERED, DateTime dateTime, DateString varchar(10), HourOfDay int, Date date, Day int, DayofYear int, DayofWeek int, DayofWeekName varchar(10), Week int, Month int, MonthName varchar(10), Quarter int, Year int, IsWeekend bit, IsLeapYear bit ) -- Declare and set variables for loop Declare @StartDate datetime, @EndDate datetime, @DateTime datetime, @iHour int -- Set the dates to match your data Set @StartDate = '2004/01/01' Set @EndDate = '2018/12/31' Set @DateTime = @StartDate -- Loop through dates WHILE @DateTime <=@EndDate BEGIN -- Check for leap year DECLARE @IsLeapYear BIT IF ((Year(@DateTime) % 4 = 0) AND (Year(@DateTime) % 100 != 0 OR Year(@DateTime) % 400 = 0)) BEGIN SELECT @IsLeapYear = 1 END ELSE BEGIN SELECT @IsLeapYear = 0 END -- Check for weekend DECLARE @IsWeekend BIT IF (DATEPART(dw, @DateTime) = 1 OR DATEPART(dw, @DateTime) = 7) BEGIN SELECT @IsWeekend = 1 END ELSE BEGIN SELECT @IsWeekend = 0 END set @iHour = 0 While @iHour <=23 begin -- Insert record in dimension table INSERT Into ElephantPM_Dim_DateTime ( [iDateTime], [DateTime], [Date], [DateString], [HourOfDay], [Day], [DayofYear], [DayofWeek], [Dayofweekname], [Week], [Month], [MonthName], [Quarter], [Year], [IsWeekend], [IsLeapYear] ) Values ( [dbo].[IntKeyForYYMMDDHH](@DateTime), @DateTime, convert(date,@DateTime), CONVERT(varchar(10), @DateTime, 6), -- See help for 6 explanation @iHour, Day(@DateTime), DATEPART(dy, @DateTime), DATEPART(dw, @DateTime), DATENAME(dw, @DateTime), DATEPART(wk, @DateTime), DATEPART(mm, @DateTime), DATENAME(mm, @DateTime), DATENAME(qq, @DateTime), Year(@DateTime), @IsWeekend, @IsLeapYear ) select @DateTime = dateadd(hour,1,@DateTime) set @iHour = @iHour +1 end END GO
The function.
/****** Object: UserDefinedFunction [dbo].[IntKeyForYYMMDDHH] Script Date: 01/04/2013 14:37:18 ******/ IF EXISTS (SELECT * FROM sys.objects WHERE object_id = OBJECT_ID(N'[dbo].[IntKeyForYYMMDDHH]') AND type in (N'FN', N'IF', N'TF', N'FS', N'FT')) DROP FUNCTION [dbo].[IntKeyForYYMMDDHH] GO USE [fogbugz-development-copy] GO ET QUOTED_IDENTIFIER ON GO -- ============================================= -- --Author: --James Bayley james@elephantpm.com -- http://www.elephantpm.com --published at http://blog.jamesbayley.com --Copyright: (C) --Please include the author block above. --Licence Creative Common Attribution 2.0 --http://creativecommons.org/licenses/by/2.0/uk/ -- ============================================= CREATE FUNCTION [dbo].[IntKeyForYYMMDDHH] ( -- Add the parameters for the function here @dateTime dateTime ) RETURNS int AS BEGIN -- Declare the return variable here DECLARE @sYYYYMMDD nchar(8) declare @iHour int declare @sHour nchar(2) declare @sYYYYMMDDHH nchar(10) declare @iReturn int -- Add the T-SQL statements to compute the return value here select @sYYYYMMDD = convert(nchar,@dateTime,112) select @iHour = datepart(hour, @dateTime) if (@iHour < 10) Begin select @sHour = '0' + convert(nchar,@iHour) End else Begin select @sHour = convert(nchar,@iHour) End select @sYYYYMMDDHH = @sYYYYMMDD + @sHour select @iReturn = convert(int,@sYYYYMMDDHH) -- Return the result of the function RETURN @iReturn END GO EXEC sys.sp_addextendedproperty @name=N'Description', @value=N'This function is used to generate the integer key used to join fact tables to the dateTime dimension with hourly granularity.' , @level0type=N'SCHEMA',@level0name=N'dbo', @level1type=N'FUNCTION',@level1name=N'IntKeyForYYMMDDHH' GO
(also known as date and time dimensions, date dimensions, dateTime dimension)
updated: 2017-01-03, added introductory section