12/26/2017
by Marlon Ribunal
2 Comments

My Learning Goals for 2018

A couple of weeks ago, the SQL Server community had their last T-SQL Tuesday of the year – 97th since 2009 – hosted by Mala (b|t). T-SQL Tuesday, a brainchild of Adam Machanic (b|t), is a monthly blog party in which participant tech bloggers write about any given topic. The topic for December is about learning goals for 2018.

I guess this post is quite late to the party but it’s good to have a baseline for what I want to learn for the new year, and so, this post is quite appropriate. Mala has given three questions that serve as guidelines for the format of the post:

  • What do you want to learn?
  • How and when do you want to learn?
  • How do you plan to improve on what you learned?

And, I’m going to use those questions to structure my learning goals.

What Do I Want to Learn?

Quick answer: Many things. I am declaring 2018 as my Career Year. So, everything I do is anchored to that theme. For 2016-2017, my mantra was Kaizen – Continuous Improvement. I want that to still be a part of what I do in 2018 – but bolder. My mantra for 2018, therefore, is Daitan’na (大胆な ): audacious, daring, bold. Daring with a hint of Kaizen, if you will.

I want to learn the following:

  • Big Data
  • Power BI
  • R Language

Yes, bold. Big Data itself is already a, well, big elephant. One bite at a time would not do it. Big Data is too big a topic to only learn it in a span of a year. But I am not shooting for mastery yet. The goal is to gain the practical skills that would translate into elements of a work experience – meaning, that I would survive if I was suddenly thrown into a Big Data dungeon.

Power BI is a natural next-step if you’ve worked with SSRS. There was a time that Tableau is a craze. Power BI can capture that market. I want to equip myself with the necessary skills that would measure proficiency. Therefore I need to build upon my Power BI skills.

T-SQL alone would not do it anymore. I need to learn a more powerful language for analysis. The R Language has been an interest to me ever since and I want to dig deeper into it.

When and How Do I Learn?

I work a full-time job. For many people who work in the tech industry, they know that a full-time job doesn’t just mean 9-5. It could mean 12 or more hours a day. You’ll never have a 40-hour week. Time is unpredictable in the tech industry. So, When, is a question that is hard to answer.

Learning itself is a philosophy that I have long embraced. I will find the time to learn even if it would mean sacrificing some of my sleeping time. Yes, that’s bold. Sleep is a commodity that even the Bourgeoisie can’t afford anymore. I’ll set a minimum of 1 hour for learning every day (this is on top of my minimum of 30 minutes of reading miscellaneous topics). That’s the minimum, meaning even if I don’t have time to study, I must find at least an hour for that purpose.

How do I learn is the easier question. I have three major places of learning: Pluralsight (courtesy of the Friends of Redgate Program), UDemy, and Microsoft Professional Program + edX. Of course, blogs, webcast, and even tech videos on YouTube and other media will help. In-Person events like the SQL Saturday are a great resource for the things that I want to learn as well but I am not sure if I have the time this year for such activity. The focus is to learn. It’s not a question of how or where do I learn but of what do I learn. And, there is an abundance of materials out there that I can use to my advantage.

How Can I Improve on What I Learned?

Learning for the sake of learning is an empty goal. The purpose of my learning goals is to scale my tech skills. It wouldn’t make sense to spend all of your precious time for nothing. Obviously, these goals are set on the personal level – learning is an aspect of my career that I have committed to. But of course, that doesn’t end there. I want to maintain certain baseline or level of skills that are readily applicable in the real world. I may not land a Big Data job or position in my current company or outside right away but I want to maintain a working-level experience at my disposal.

One avenue that I can probably utilize for sharing what I’ve learned is my blog. Well, tech blogging is one of the aspects that I want to improve on this year. I believe that my blog is an indispensable part of my career. So, this year, I also want to set aside some time for writing. And, of course, participating in forums should also be on the list.

Like I said, these learning goals are rather bold. But, if I want to focus on my career this year, it is just fitting that I have a bold mentality.

Daitan’na. Kaizen. Daring with a hint of Kaizen.

10/03/2017
by Marlon Ribunal
1 Comment

Book Review: High Performance SQL Server

As SQL Server professionals, we sometimes encounter some issues that we haven’t seen before or have seen already but forgot how we fixed them. There are few things that we usually do in these situations: Diagnose using sp_whoisactive or sp_Blitz, or any other tools that we have at our disposal, try to remember what we did to troubleshoot the issues, fire up our Wait Stats queries, check DMVs or maybe Extended Events, etc.

If none of these work, then we do what normal data pros usually do in desperate times: Google stuff or maybe post a question or two on Twitter using the hashtag #sqlhelp. But the problem is that the information that we need is all over the interwebs. There is just too much information but little time to look at each one of them.

If you are like me, you still find technical books to be relevant these days. And I mean the real, physical books. Great technical books offer high-value content that is readily available in one place (that is if you picked the right book for your purpose). Plus, if you’re looking to unplug, reading books is a good way to pry yourself away from the screen. For example, High-Performance SQL Server: The Go Faster Book by Benjamin Nevarez (B|T). I recently added this book to my SQL Server shelf.

Some of you know that I started a new job this year with a software company that caters to the retail industry (our clients are big brand retailers). Part of my day-to-day job is troubleshooting performance issues on SQL Server. So aside from reading technical blogs, I read technical books to improve my skills.

Let’s go back to Ben’s book. With only 200 pages, this book is not overwhelming to read. Yes, it’s short and it doesn’t have all the things about SQL Server but, I think, it has all the important things that I need to know. And, yes, it covers up to SQL Server 2016. It has 9 Chapters:

High Performance SQL Server

  1. How SQL Server Works
  2. Analyzing Wait Statistics
  3. The Query Store
  4. SQL Server Configuration
  5. TempDB Troubleshooting and Configuration
  6. SQL Server In-Memory Technologies
  7. Performance Troubleshooting
  8. Indexing
  9. SQL Server Storage

The chapters that I found important (of course, this is relevant to who is reading the book) are 2, 5, 7, and 8.

Chapter 2 deals with Wait Stats. We love Wait Stats. If you are like me, you probably have SQL Server Wait Statistics: Tell Me Where It Hurts query from Paul Randal (B|T) in your arsenal. I haven’t perfected the Wait Statistics methodology of performance tuning yet and this is why I like this chapter. This chapter “explains how the task execution process fits into waits and queues performance methodology”. I know what processes are waiting but why they are waiting is the most important question to answer. This chapter discusses waiter list, runnable queues, task execution process, Extended Events, Latches and Spinlocks, Blocking and many other things about analyzing wait statistics.

Chapter 5 is about TempDB. One of the common issues that I encounter in my job is issues with TempDB running out of space although there is enough physical disk space for the TempDB files. When an application is doing tons of concurrent tasks that depend heavily on TempDB, you may want to be proactive about the health of you TempDB. Well, according to Brent Ozar (B |T), TempDB is the public toilet of SQL Server, so we better keep it clean. The information from this chapter will help you maintain the TempDB and keep it sanitary. Topics discussed are fixing latch contention, data file pages, Trace Flag 1117 & 1118, tempdb events, TempDB spills, monitoring TempDB disk space, etc.

Chapter 7 covers the overall performance troubleshooting. The topics discussed are performance counters, comparing batches and transactions, log growth, other DMVs not discussed in previous chapters, SQL Trace & Extended Events, Data Collector, what’s new in SQL 2016, etc.

Chapter 8 is about Indexes. This chapter explains how SQL Server uses indexes. It also offers tips on where to use indexes, how to work with indexes and how to use the execution plans, missing indexes feature, Database Engine Tuning Advisor & Index Tuning Wizard, etc. One does not simply over-study indexes.

Ben Nevarez is one of the people I respect and look up to in the SQL Server Community because of what they do in helping other SQL Server professionals grow in their career and personal life. But this is not the only reason why I like this book. The topic selections and how they were delivered in a minimal number of pages ensure that I get the information that I most need to know about performance tuning and not get overwhelmed learning about them.

This book is a good complementary material to Pro SQL Server Internals: Understand What Happens Under The Hood and How it Affects You by Dmitri Korotkevitch (B|T) from the same publisher as Ben’s, which I also have.

02/16/2016
by Marlon Ribunal
4 Comments

Power BI KPI Report: Hospital Charge Data

Update: Just getting started with Power BI? You can download the Power BI Report File for this KPI Report. Please find the link at the end of this post.

T-SQL Tuesday Blog

The Power BI KPI report is probably on top 10 of the most in-demand use-case for Power BI.  In my previous post, I published the report I created with Power BI Desktop based on the Medicare payment data that I downloaded from Data.gov. That report was inspired by the New York Time’s “How Much Hospitals Charged Medicare for the Same Types of Cases” visual report. What I did differently on my report is slice the data a little bit further down to the provider level. If you check both reports, NYT’s and mine, you’ll see what I’m talking about.

I came across  Jorge Segarra‘s (@sqlchicken) T-SQL Tuesday challenge to publish a Power BI report on the web for T-SQL Tuesday. So I went back to my Medicare Payment Power BI report and think of few ways to make it visually appealing and more “useful”.

If you’ve never tried Power BI before, I have written some getting-started posts:

Getting Started With Power BI Desktop: Installation

Getting Started With Power BI Desktop: Getting Data

Getting Started With Power BI Desktop: Simple Visualization

tl;dr

If you want to skip the whole how-to below, go directly to the Power BI KPI report by clicking the picture below.

Hospital Charges KPI Report in Power BI

Guide to reading the Hospital Charge KPI report

Here’s a simple illustration of how to read this Power BI KPI report. I hope it’s intuitive enough.

Power BI Medicare Hospital Charges KPI Report

Power BI KPI: About the data

I downloaded the data that I used for this Power BI KPI report from Data.gov. I recommend you visit this site if you’re looking for data to play with for Power BI. The major columns that I focused on for this report are the Provider Name (Hospital), Average Covered Charges, Average Total Payment, and Average Medicare Payment. The amount of dollars provided in the data source and in this Power BI KPI report are not actual amount on per-case basis but the actual average of all those cases grouped together.

For example, the provider Southeast Alabama Medical Center have the following average amounts for the DRG case  Extracranial Procedures w/o CC/MCC:

No of cases: 91

Average Covered Charge: $32,963.07

Average Total Payment:  $5,777.24

Average Medicare Payment:  $4,763.73

Just a little overview: “The data provided here include hospital-specific charges for the more than 3,000 U.S. hospitals that receive Medicare Inpatient Prospective Payment System (IPPS) payments for the top 100 most frequently billed discharges, paid under Medicare based on a rate per discharge using the Medicare Severity Diagnosis Related Group (MS-DRG) for Fiscal Year (FY) 2011. These DRGs represent more than 7 million discharges or 60 percent of total Medicare IPPS discharges.”

Slicers

In this Power BI KPI report, the slicers, or filters, in this case, are Provider State, Provider City, and Provider Name. All the slicers are set up as illustrated (Horizontal orientation and Single Selection):

Power BI Slicers

Slicer Interactions

The Slicer Interactions is a handy property that you can set to enable the slicer to cascade into another slicer. It also interacts as filters to other elements within the report.

If you don’t see Funnel and Prohibition/No Sign icons on the visualization, select the Edit Interaction item in the Power BI ribbon:

Edit Interactions Power BI

Interaction_ON = Interaction between two (2) visualization or more components is ON.

Interaction_OFF = Interaction between two (2) visualization or more components is OFF.

The table below indicates (shaded/grayed) which visualization components interact with which component:

Power BI Interaction Table

In the example below, the Provider Name slicer interacts with, or in this case, filters, the actual values and KPI’s but not with the Minimum values, Maximum values, and Average values cards.

Power BI KPI Report Interaction

New Table for Averages

For the DRG averages, I created a separate table using the SUMMARIZE() function; and created a relationship between that and the main table using the DRG Definition as the relationship key.

[sql]
AveragesPerCase =
SUMMARIZE(
HospitalCharges, HospitalCharges[ShortDRGDefinition],
"AvgCharges", AVERAGE(HospitalCharges[ Covered Charges ]),
"AvgPayments", AVERAGE(HospitalCharges[Payments]),
"AvgMedicarePayments", AVERAGE(HospitalCharges[Medicare Payments])
)
[/sql]

Setting Relationship in Power BI

String Function to remove Prefix

The DRG Definition has prefix numeric code [e.g., “nnn - “]. I created a new Column without this prefix using the RIGHT() and LEN() functions:

[sql]
ShortDRGDefinition =
RIGHT(
[DRG Definition],
LEN([DRG Definition])-6
)
[/sql]

Before039 - EXTRACRANIAL PROCEDURES W/O CC/MCC

After: EXTRACRANIAL PROCEDURES W/O CC/MCC

Minimum and Maximum Values

For this Power BI KPI report, I created new measures for Min/Max values for Covered Charges, Payments, and Medicare Payments using the MIN() and MAX() functions:

Minimum Values

[sql]
Min Covered Charges =
MIN([ Covered Charges ])
[/sql]

Maximum Values

[sql]
Max Covered Charges =
MAX([ Covered Charges ])
[/sql]

Multi-Row Cards

For the visualization of these min/max values, I used Multi-row card visualization component containing the Provider (Hospital) that charges that min/max amount and the min/max amount:

multi-row card power bi

To find the matching provider to the min/max value, I used the LOOKUP() function:

[sql]

Min Charge Provider =
IFERROR(LOOKUPVALUE(HospitalCharges[Provider Name],
HospitalCharges[ Covered Charges ],
MIN([ Covered Charges ])),
"*Multiple hospitals")
[/sql]

MIN() function returning multiple matchThe IFERROR() function is there to handle multiple values. What I found out is that there are some min/max criteria that return multiple hospitals. For example, hospital A and B has the same dollar amount for x DRG which happens to be the minimum/maximum dollar amount in that category. So, if two (2) or more hospitals share the min, or max, dollar amount, the field will return the literal “*Multiple hospitals.”

Custom Visualization: KPI

Power BI Desktop, as of this writing, does not have a native or built-in KPI visual component. Thanks to Power BI Visuals Gallery. As adaption to Power BI grows, I’m sure we can have more and more of these visual components added to the gallery.

To add a third-party visual component to your Power BI report, Download the visual from the gallery. On the Power BI report designer, click on the Ellipsis on the Visualizations pane to add a visual component file:

Adding a Visualization Component in Power BI

Since the component is from a third-party, Power BI won’t initially allow you to use it. It will notify you that you are taking a risk in installing it.

Aside from the notification bar that appears on the report designer, the visualization will appear as follows:

third-party visualization warning on Power BI

To enable the visualization, click on the enable button on the notification bar:

Enable Power BI Custom Visulization

Key Performance Indicator in Power BI

The custom visualization that I used is called KPI Status with History (downloaded from the Power BI Visual Gallery). I don’t particularly need the “history” portion of the component for this particular KPI. I just need to see the deviation of the “actual value” from the “target value” which is the average value.

The threshold is set as follows (built-in in the visual component):

LOW (GREEN) – Less than Zero Amount compared to the Target Value

MEDIUM – OK (YELLOW) – 3% Band

HIGH (RED) – More than the 3% banding

My baseline for the KPI is the “average amount” (the “target value”) for that particular DRG with a 3% band, which means anything that goes beyond 3% of the “target value” is considered HIGH.

KPI Baseline Band Threshold

The percentage represented is the percentage of the increase/decrease based on the average amount. Think of it as a “markup” percentage, which is calculated as:
Markup Percentage Equation

Where:

Target Value = Average Amount

Actual Value = Actual Value of the selected provider

In the illustration above, the Covered Charges amount of $24,549.18 is 44.1% more than the average Covered Charges, which is $17,732.65 (HIGH – RED). The Payment amount of $5,069.63 is within the 3% band, which is 1.7% more than the Average Payment amount of $4,983.95 (OK – YELLOW). The Medicare Payment of $3,453.45 is 11.9% lesser than the Average Medicare Amount of $3,919.91 (LOW – GREEN).

Visualization Formatting in Power BI

I set all the Multi-row card with the same width and height. I used a rectangle shape to group related visualizations together (min values, max values, average values, and the current values and their KPI). The visualization properties are the easiest way to align components in your report.

setting visualization properties in Power BI

Adding a background to the Power BI report

Here’s how to add a background photo to your report.

Click an empty space in your report, so that the report itself is selected (and not any of the components). Expand the Page Background from the Format Menu in the Visualization pane and click the Add Image button:

Adding Background Photo for your Power BI Report

After the image has been imported and embedded to the report, select the Transparency and the Image Fit that you want.

Power BI Publication to Web

The latest feature in Power BI is the ability of the Power BI developer to give access to anonymous report viewers or public access.

From the Power BI Desktop, save and Publish the report to Power BI Web:

Go to File > Publish > Publish To Web

Publish Power BI Report

Then login to your Power BI web Account.

Open/select the Report that you want to publish to web. Go to File >Publish To Web

Power BI Publish To Web Feature

Power BI will give you a link that you can share via email or social media sites like Twitter, LinkedIn, and Facebook. It also provides you a code that you can use to embed the report in your blog.

Power BI Publish To Web Code

For example, see my previous blog for the embedded Power BI Report.

Try it for yourself

Here you can download the Power BI Report file: HospitalChargeKPI.pbix