Up to this point in our design patterns sessions we’ve concentrated on removing duplicates from our troubled table. Now, let’s look into preventing those duplicates from being inserted in the first place. If you haven’t heard the phrase upsert before, it’s the idea of doing an INSERT and UPDATE in the same statement, that way…
Category: Microsoft SQL
De-duping by CTE
Ok, today we wrap up our design patterns for removing duplicates from tables by turning to common table expressions (CTE’s). When I start talking CTE’s some people begin to cringe in fear. I’ll admit, I still have to refer to a reference when doing something I haven’t done before with a CTE, which is pretty…
De-duping by Self join
Let’s continue the series on getting rid of duplicate data. Today we’re going to use self joins to get rid of them. A self join is a join from a table back to itself. It might seem weird to do that, but by the end of this article you’ll see how it’s useful. I use…
De-duping by temp table
Last time, we learned that manually removing duplicate rows from a table could be a long and manual process. Especially when you have as many duplicates as I created in our troubled table. Today we’re going to look at using temp tables to remove the extra rows. I gave you the overview to this already,…
Manually Deleting Stinks!
Go ahead and set up your TroubledTable by running the Create Troubled Table T-SQl Script. Now take a look at your duplicate rows. SELECT email, COUNT(*) FROM troubledTable GROUP BY email HAVING COUNT(*) > 1 You’ve got 200 rows that have been duplicated over and over. Not a pretty sight. Imagine how long it you…
Design Patterns Session
On February 6, I’m going to tele-present a session on T-SQL design patterns to the Brazil SQL Server Users Group. During this session I’ll cover three common scenarios you face, that can be solved by using T-SQL: Removing duplicate rows from tables. Create Troubled Table T-SQl Script — This contains the table we’ll use for…
How Much Logging Goes on for a Clustered Index Drop?
A week back @fatherjack put out a question on #sqlhelp that immediately drew my curiousity. He wanted to know how much his log file would grow if he dropped his clustered index (primary key), and then created a new one. I was thinking that it wouldn’t write a lot of information to the log file,…
Who Shrank Roger Rabbit
It all started Thursday afternoon, about 1500. I was walking back from Starbucks with my afternoon pick-me-up, when a co-worker mentioned we have this process or person who shrinks the log file, apparently every week. “Is there some way I can find out who, or what is shrinking the log files?” And with that, I…
88-970: My Study Guide
Since I know I’m going to go for the SQL MCM next year, I’m going to start collecting all of my study materials here, on my blog. Hopefully that way it can jump start your own study process towards that goal. I’ll mark all the entries with the tag 88-970, since I don’t really know…
70-450: Complete!
After another month of studying, I finally took the 70-450. I’m happy to say I passed! I did learn one thing from the exam: I should spend more time with encryption techniques and what all you can accomplish through encryption. Unfortunately, I’ve never had the business excuse to practice with encryption. I am interested in…