SAS Day 41 Background Story: Most of the time we use Proc Freq binomial to generate confidence interval for 2-level categorical variable compairson, such as, Success or Failure. We also use CI interval to represent the proportion estimated in the dataset (Upper Limit, Lower Limit). Once instead of directly applying the Upper Limit and Lower Limit, my boss asked me…

# Category: SAS

## Forest Plot for Overall Survival Confidence Interval

SAS Day 40: Forest Plot Tired looking across many overall response tables while comparing the same rate? The forest plot is a perfect solution to summarise data from multiple resources and convert it into a single reader-friendly graph. Ideal Forest Plot: When taking all the relevant studies, ask the same question with the same statistical model, identifies a common…

## Proc SQL Overview

SAS Day 39: When I was in Bogota, Colombia, Martha showed me how her son used a Mind-map to create a holiday schedule. I really loved it. As I m summarizing the applications of SQL. I decide to use a Mind-map for an overview and provide a detailed example for each category in the future. SAS Proc SQL Overview: …

## Proc Sql Full Join

SAS day 38 SAS Merge is a good data step to join couple datasets together, nonetheless, SQL join is a fancier one, not necessarily easy to use, but definitely more sophisticated! Today we will show a SQL Full Join with 3 datasets, respectively, a, b, c. Full join for Two Datasets: proc sql noprint; create table all as select…

## Proc Freq

SAS Day 37 In previous days, we discussed generating Statistical Summary using Proc Means and Proc Univariate for Continuous Variables(Age, BMI, Height, Weight). Now, there is a question, what procedure should we apply to generate Categorical Variables (Sex, Race, Country)? Today we will show how to use Proc Freq to summarize the Categorical Variables. Proc Freq: Sample dataset: Subjid ID:…

## Proc Univariate

SAS Day 36 In SAS Day 27, we showed using Proc Means to generate the statistical summaries for Continuous Variable such as (Age, BMI, Height, Weight). As an old idiom stated, “All Roads lead to Rome”. Today we will introduce Proc Univariate to create the Summary Statistics. Task: Proc Univariate Sample for Statistical Summaries Sample Dataset…

## SAS Retain 1

SAS Day 35 Once, I supposed to generate an Overall Response dataset including Label(lbl), patients with escalation(esc_y), patients without escalation(esc_n), and total patient(all) . Most of the time, the following dataset is sufficient for the purpose, however, the request was special, we have a tiny challenge. Challenge : so we need the dataset to be in exact order: label, escalation…

## SAS Iris dataset Plot

SAS Day 34: Background Story: Once, in my machine learning class, the professor asked what software do we use for data science? One student answered: “SAS“. Then the professor laughed and said: “Oh dear, you must be in the wrong class, nobody uses SAS in data science industry“.

## SAS Boxplot

SAS Day 33: Box Plot Definition: Box Plot or Whisker plot displays the distribution of 5-number summary of a dataset: minimum, maximum, q1, q3, and Median. Interpreting quartiles: The 5-number summary approximately divides the data into 4 sections that each containing 25% of the data. Explore a little more If we want to look at the Outliers, we define the…

## SAS Loess Plot

SAS Day 32: Loess Model Scatter Plot When I was in Middle School, Sin(x) and Cos(x) are my favorite curves, because they are so predictable! Once we know the cycle and amplitude, we can solve everything about it. However, in statistical modeling, oscillating curves are not so welcomed. Today we will introduce the Loess Plot for finding a curve of…