R Software Training Kenya | Tobit Research Consulting Nairobi
Statistical Software Training • Course 05

R Software Training in Kenya — Statistical Computing & Data Analysis

R is an open-source, extensible statistical computing language and environment available for Windows, Macintosh, UNIX, and Linux — completely free to download and use. It acts as a powerful alternative to traditional packages such as SPSS, SAS, and Stata, but with far greater flexibility, exceptional visualisation through ggplot2, and over 20,000 freely available packages. Our expert-led training takes you from your very first command in RStudio through data wrangling, visualisation, and advanced statistical modelling for research.

3 Modules 15 Hours Total 100% Free Software Online & Physical Ksh 5,000 / Module 1-on-1 Available

Course Details

📦
3 ModulesBeginner · Intermediate · Advanced
5 Hours / Module15 hours total programme
💻
Online or PhysicalBased on your preference
🤝
1-on-1 AvailableIndividual sessions on request
📅
Flexible TimingMon–Fri 9am–6pm · Sat 9am–5pm
Ksh 5,000
per module · per individual
🤷

R & RStudio are 100% free to download and use

3
Training Modules
15hrs
Total Duration
5,000+
Students Trained
🤷
Free Open Source
Overview

What is R & Why Should Researchers Learn It?

R is an open-source, extensible programming language and statistical computing environment available for free on Windows, Macintosh, UNIX, and Linux. It serves as a powerful alternative to SPSS, SAS, and Stata — offering far greater flexibility through its vast ecosystem of freely available packages covering every area of statistical analysis.

Unlike commercial software requiring expensive licences, R is completely free. This makes it an increasingly preferred tool for academic researchers and professionals in Kenya — particularly for research requiring advanced statistical techniques and high-quality data visualisation. R was created specifically to help researchers and analysts apply statistical methods to socio-economic, health, business, and scientific research problems.

Through packages such as ggplot2, dplyr, tidyr, and the Tidyverse, R provides powerful tools for data manipulation, exploratory analysis, regression modelling, and publication-quality graphics that far surpass what SPSS or Excel can produce.

R’s key advantage: With over 20,000 freely available packages on CRAN, R can address almost every statistical inference problem — from basic descriptive statistics to machine learning and advanced econometrics — all within a single, completely free environment that runs on any computer.
R vs Other Statistical Software
FeatureSPSSStataR ✓SAS
CostExpensiveAffordableFreeExpensive
VisualisationGoodVery GoodExcellentGood
PackagesLimitedGood20,000+Limited
Reproducible ReportsLimitedLimitedR MarkdownLimited
Machine LearningBasicBasicFull supportGood
Learning CurveGradualGradualModerateSteep
Key R Packages You Will Use
gg
ggplot2

Publication-quality visualisation

dp
dplyr

Fast data manipulation

td
tidyr

Reshaping and tidying data

rd
readr

Importing CSV and data files

tv
tidyverse

Core data science collection

rm
R Markdown

Reproducible research reports

Training Objectives

What You Will Achieve

By the end of this training, participants will have the practical R skills needed to perform rigorous statistical analysis for research, academic work, and professional data analysis.

01

Apply R software in practical analysis and research of socio-economic problems at school and in the workplace — using R to clean, transform, analyse, and visualise real research datasets effectively.

02

Apply R and complementary software in academic projects and thesis work — generating statistically rigorous results for Masters and PhD dissertations across social sciences, economics, public health, and business.

03

Use R’s Tidyverse ecosystem (dplyr, ggplot2, tidyr, readr) to efficiently import, clean, wrangle, and visualise data from real-world research datasets using best-practice reproducible workflows.

04

Conduct statistical inference and modelling in R — including descriptive statistics, correlation, regression, ANOVA, and hypothesis testing — with correct interpretation for thesis chapters.

How We Teach

Training Mode & Approach

🏫

Class-Based Training

Structured, expert-led sessions online via video call or in person at our Nairobi offices, with step-by-step instruction throughout.

💻

Practical R Use

Every session is hands-on in RStudio. You write real R code, run live analyses, and work directly on your own or provided research datasets.

📝

Take-Away Assignments

Each module includes structured practice assignments to reinforce learning and apply R techniques to your own research data.

🤝

One-on-One Sessions

Individual sessions available on request — ideal for personalised guidance on your specific research project or thesis dataset.

Your Journey

The Three-Module Learning Path

Each module builds progressively — from R and RStudio fundamentals through data wrangling and visualisation to statistical modelling and advanced analysis.

01
Beginner

Introduction to R & RStudio

R and RStudio setup, R syntax basics, data types, vectors, data frames, importing data, and foundational data manipulation commands.

02
Intermediate

Data Wrangling & Visualisation

dplyr for data manipulation, tidyr for reshaping, ggplot2 for publication-quality charts, and exploratory data analysis techniques.

03
Advanced

Statistical Analysis & Modelling

Descriptive statistics, hypothesis testing, correlation, regression, ANOVA, and R Markdown for reproducible research reports.

Course Content

Detailed Module Content

Every topic is covered hands-on in RStudio using real datasets. Each module is Ksh 5,000 per individual and runs for 5 hours.

Mod1

Introduction to R & RStudio Basics

R environment · Syntax · Data types · Vectors · Data frames · Data import & export

Ksh 5,000
5 hours

Introduction to R & RStudio

What is R and how it differs from SPSS, Stata, and SAS
Installing R and RStudio — both free for Windows, Mac, and Linux
The RStudio interface — Console, Script Editor, Environment, and Plots panels
Writing and running R scripts
Setting the working directory
Installing and loading packages — install.packages() and library()
Getting help in R — help(), ?, and documentation

R Syntax & Data Types

Basic R data types
Numeric, integer, character, logical
Factors and ordered factors for categorical variables
Creating and manipulating vectors
Arithmetic and logical operators
Matrices — creating and indexing
Lists — creating and accessing elements
Writing basic functions in R

Data Frames — The Research Dataset in R

Creating data frames from scratch
Indexing and slicing data frames
Adding, renaming, and removing columns
Filtering rows with conditions
Handling missing values (NA) in data frames

Importing & Exporting Data

Importing CSV files with read.csv() and readr
Importing Excel files with readxl
Importing SPSS .sav files with the haven package
Exploring data — head(), str(), summary(), glimpse()
Exporting cleaned data to CSV and Excel
Mod2

Data Wrangling & Visualisation with Tidyverse

dplyr · tidyr · ggplot2 · Data cleaning · Reshaping · Exploratory analysis

Ksh 5,000
5 hours

Data Manipulation with dplyr

Introduction to the Tidyverse ecosystem
The pipe operator (%>%) — chaining commands cleanly
Core dplyr verbs
filter() — subsetting rows by conditions
select() — choosing specific columns
mutate() — creating new variables
arrange() — sorting data ascending and descending
summarise() — aggregating data
group_by() — grouped summaries by category
Joining data frames — left_join, inner_join, full_join
Recoding variables with case_when()

Data Reshaping with tidyr

Wide vs long data format — when and why
pivot_longer() — converting wide to long
pivot_wider() — converting long to wide
separate() and unite() — splitting and combining columns
Handling missing data — drop_na(), replace_na(), fill()

Data Visualisation with ggplot2

The grammar of graphics — layers, aesthetics (aes), and geoms
Core chart types
Scatter plots — geom_point()
Line graphs — geom_line()
Bar charts — geom_bar() and geom_col()
Histograms — geom_histogram()
Box plots — geom_boxplot()
Density plots — geom_density()
Faceting — facet_wrap() and facet_grid() for subgroup plots
Customising themes, colours, labels, and legends
Saving publication-quality plots — ggsave() at 300 dpi for thesis
Mod3

Statistical Analysis & Modelling in R

Descriptive stats · Hypothesis testing · Correlation · Regression · ANOVA · R Markdown

Ksh 5,000
5 hours

Descriptive Statistics

Measures of central tendency — mean, median, mode
Measures of dispersion — standard deviation, variance, range, IQR
Frequency tables and cross-tabulations
Skewness and kurtosis — the psych package
Checking normality — Shapiro-Wilk test, QQ plots, histograms
Descriptive statistics by subgroup using dplyr and summarise()

Hypothesis Testing

Formulating null and alternative hypotheses in R
p-values, confidence intervals, and significance levels
One-sample and two-sample t-tests
Independent samples vs paired t-tests
Chi-square test for categorical variables
Non-parametric tests — Mann-Whitney U, Wilcoxon

Correlation & Regression Analysis

Pearson and Spearman correlation
Correlation matrix visualisation — corrplot package
Simple linear regression — lm() and output interpretation
Multiple linear regression — coefficients, R², F-test, p-values
Regression diagnostics — residuals, normality, linearity, homoscedasticity
Logistic regression for binary outcomes — glm()

ANOVA & Reproducible Reports

One-way ANOVA — aov() and result interpretation
Post-hoc tests — TukeyHSD for pairwise comparisons
Two-way ANOVA with interaction effects
Introduction to R Markdown for reproducible research
Generating Word, PDF, and HTML reports from R Markdown
Publication-ready tables with knitr and flextable
In Practice

Common R Commands You Will Learn

Our training is completely hands-on. You write and run real R code in RStudio from day one — directly applicable to your own research datasets.

Data Import & Exploration
# Load the Tidyverse library(tidyverse) library(readxl) library(haven) # Import from CSV df <- read_csv("mydata.csv") # Import from Excel df <- read_excel("mydata.xlsx") # Import from SPSS .sav df <- read_sav("mydata.sav") # Explore the dataset glimpse(df) summary(df) head(df, 10)
Data Wrangling with dplyr
# Filter, select, mutate with pipe df_clean <- df %>% filter(age > 18) %>% select(id, age, income, education) %>% mutate( log_income = log(income), edu_group = case_when( education < 12 ~ "Primary", education < 16 ~ "Secondary", TRUE ~ "Tertiary" ) ) # Grouped summary statistics df %>% group_by(edu_group) %>% summarise( mean_inc = mean(income, na.rm = TRUE), n = n() )
Visualisation with ggplot2
# Scatter plot with regression line ggplot(df, aes(x = age, y = income)) + geom_point(color = "steelblue", alpha = 0.6) + geom_smooth(method = "lm", se = TRUE) + labs( title = "Age vs Income", x = "Age (years)", y = "Monthly Income (Ksh)" ) + theme_minimal() # Save for thesis (300 dpi) ggsave("plot.png", dpi = 300)
Statistical Tests & Regression
# Descriptive statistics psych::describe(df$income) # Shapiro-Wilk normality test shapiro.test(df$income) # Pearson correlation cor.test(df$age, df$income, method = "pearson") # Multiple linear regression model <- lm(income ~ age + education + gender, data = df) summary(model) # One-way ANOVA fit <- aov(income ~ edu_group, data = df) TukeyHSD(fit)
Why R

Key R Capabilities You Will Master

R’s combination of statistical power, free availability, and exceptional visualisation makes it one of the most valuable tools a researcher in Kenya can learn.

🤷

Completely Free & Open Source

R and RStudio are both 100% free — no licence fees, no annual renewals needed. Install on any computer and use for life at zero cost.

📊

Excellent Data Visualisation

ggplot2 produces publication-quality, highly customisable charts far superior to SPSS or Excel — perfect for theses and journal submissions.

🔧

Powerful Data Wrangling

The dplyr and tidyr packages make data cleaning, filtering, reshaping, and transformation intuitive and fast — even for complex, messy datasets.

📈

Comprehensive Statistical Analysis

From basic descriptive statistics to advanced regression, ANOVA, and multivariate analysis — R covers every technique required for postgraduate research.

📄

Reproducible Research with R Markdown

Combine code, results, and narrative in one document — generating Word, PDF, or HTML reports automatically. Essential for transparent thesis writing.

📦

20,000+ Free Packages

CRAN hosts over 20,000 R packages covering every analytical need — from basic statistics to machine learning and bioinformatics — all completely free.

Is This For You?

Who Should Attend R Software Training?

R is valuable for researchers who want advanced visualisation, greater analytical flexibility, and a tool that will never cost them a licence fee.

🎓

Masters & PhD Students

Need to perform data analysis for your dissertation but want a more flexible, free alternative to SPSS? R fully covers everything required for Chapters 4 and 5 — regression, ANOVA, hypothesis testing, and publication-quality charts.

🌎

Social Science & Development Researchers

R was specifically designed for socio-economic and social science data analysis. Whether analysing household surveys, poverty data, or community health datasets — R handles it all.

📊

Economists & Finance Professionals

Need advanced econometric techniques beyond SPSS? R’s packages for panel data, time-series, and regression modelling give you complete analytical freedom.

🏥

Public Health Researchers

R is widely used for epidemiological analysis, survival analysis, and logistic regression — ideal for public health, clinical research, and health policy dissertations.

🔄

SPSS Users Looking for More Flexibility

Already know SPSS but feeling limited? R offers everything SPSS does — and much more — for free. We cover transitioning your existing SPSS .sav files into R using the haven package from day one.

💼

Data Analysts & Professionals

Need to add R to your professional skill set? Our training equips you with job-ready data wrangling, visualisation, and reporting skills using Tidyverse and R Markdown.

Questions Answered

Frequently Asked Questions

Everything you need to know before enrolling in our R software training programme.

Do I need any programming experience to learn R?
No programming experience is required. Module 1 starts completely from scratch — we cover how to install R and RStudio, understand the interface, and write your very first R commands step by step. The training is designed for researchers and students, not programmers. Many participants have no coding background at all and successfully complete all three modules.
How much does R cost to download and use?
R and RStudio are both completely free to download and use — there are no licence fees, annual renewals, or student discount programmes needed. You simply download R from CRAN and RStudio Desktop from the Posit website, both at no cost. This makes R one of the most financially accessible statistical tools for Kenyan researchers.
Is R accepted in Kenyan university dissertations?
Yes. R is accepted and increasingly encouraged across Kenyan universities for postgraduate data analysis — particularly in economics, social sciences, public health, environmental science, and business research. Its use is growing rapidly as an alternative to SPSS, especially at PhD level where more advanced analytical techniques are required.
Can I take just one module — for example, only Module 2 for ggplot2 and dplyr?
Yes. Each module is independently bookable at Ksh 5,000. If you already know R basics and want to focus on data wrangling with dplyr and visualisation with ggplot2, you can book Module 2 alone. We recommend a working knowledge of R basics before joining Module 2 or 3 independently.
How does R compare to SPSS for research analysis?
SPSS is a point-and-click interface suited to standard statistical tests. R is a full programming language — which means a slightly steeper initial learning curve, but far greater flexibility, power, and zero cost. R’s ggplot2 produces significantly better visualisations than SPSS, it handles more advanced models, and it never costs a licence fee. Many researchers use both — SPSS for quick analysis and R for visualisation and advanced modelling.
Will the training use my own research data?
Absolutely. Bringing your own research dataset makes every exercise directly applicable to your thesis. If you are still collecting data, we will use representative example datasets covering socio-economic, health, and education research contexts.
Is R Markdown covered in the training?
Yes. R Markdown is covered in Module 3. You will learn how to create reproducible research reports combining your R code, analysis results, and written narrative — generating Word, PDF, or HTML documents automatically. This is an extremely valuable skill for thesis writing and increasingly required for journal publication.
Ready to Begin?

Book Your R Software Training Session

Fill in our booking form or contact us directly by phone or WhatsApp. Our team is available Monday to Saturday to confirm your slot and answer any questions.

Get In Touch

Have Questions? Contact Us Directly

We are happy to help you decide which module to start with or discuss how R fits your specific research needs.

Contact Us. We are ready to help you!

Let's have a chat