R Software Training Kenya | Tobit Research Consulting Nairobi

Statistical Software Training • Course 05

R Software Training in Kenya — Statistical Computing & Data Analysis

R is an open-source, extensible statistical computing language and environment available for Windows, Macintosh, UNIX, and Linux — completely free to download and use. It acts as a powerful alternative to traditional packages such as SPSS, SAS, and Stata, but with far greater flexibility, exceptional visualisation through ggplot2, and over 20,000 freely available packages. Our expert-led training takes you from your very first command in RStudio through data wrangling, visualisation, and advanced statistical modelling for research.

3 Modules 15 Hours Total 100% Free Software Online & Physical Ksh 5,000 / Module 1-on-1 Available

View Course Modules Book This Course

Course Details

📦

3 ModulesBeginner · Intermediate · Advanced

⏱

5 Hours / Module15 hours total programme

💻

Online or PhysicalBased on your preference

🤝

1-on-1 AvailableIndividual sessions on request

📅

Flexible TimingMon–Fri 9am–6pm · Sat 9am–5pm

Ksh 5,000

per module · per individual

🤷

R & RStudio are 100% free to download and use

Training Modules

15hrs

Total Duration

5,000+

Students Trained

🤷

Free Open Source

Overview

What is R & Why Should Researchers Learn It?

R is an open-source, extensible programming language and statistical computing environment available for free on Windows, Macintosh, UNIX, and Linux. It serves as a powerful alternative to SPSS, SAS, and Stata — offering far greater flexibility through its vast ecosystem of freely available packages covering every area of statistical analysis.

Unlike commercial software requiring expensive licences, R is completely free. This makes it an increasingly preferred tool for academic researchers and professionals in Kenya — particularly for research requiring advanced statistical techniques and high-quality data visualisation. R was created specifically to help researchers and analysts apply statistical methods to socio-economic, health, business, and scientific research problems.

Through packages such as ggplot2, dplyr, tidyr, and the Tidyverse, R provides powerful tools for data manipulation, exploratory analysis, regression modelling, and publication-quality graphics that far surpass what SPSS or Excel can produce.

R’s key advantage: With over 20,000 freely available packages on CRAN, R can address almost every statistical inference problem — from basic descriptive statistics to machine learning and advanced econometrics — all within a single, completely free environment that runs on any computer.

R vs Other Statistical Software

Feature	SPSS	Stata	R ✓	SAS
Cost	Expensive	Affordable	Free	Expensive
Visualisation	Good	Very Good	Excellent	Good
Packages	Limited	Good	20,000+	Limited
Reproducible Reports	Limited	Limited	R Markdown	Limited
Machine Learning	Basic	Basic	Full support	Good
Learning Curve	Gradual	Gradual	Moderate	Steep

Key R Packages You Will Use

ggplot2

Publication-quality visualisation

dplyr

Fast data manipulation

tidyr

Reshaping and tidying data

readr

Importing CSV and data files

tidyverse

Core data science collection

R Markdown

Reproducible research reports

Training Objectives

What You Will Achieve

By the end of this training, participants will have the practical R skills needed to perform rigorous statistical analysis for research, academic work, and professional data analysis.

Apply R software in practical analysis and research of socio-economic problems at school and in the workplace — using R to clean, transform, analyse, and visualise real research datasets effectively.

Apply R and complementary software in academic projects and thesis work — generating statistically rigorous results for Masters and PhD dissertations across social sciences, economics, public health, and business.

Use R’s Tidyverse ecosystem (dplyr, ggplot2, tidyr, readr) to efficiently import, clean, wrangle, and visualise data from real-world research datasets using best-practice reproducible workflows.

Conduct statistical inference and modelling in R — including descriptive statistics, correlation, regression, ANOVA, and hypothesis testing — with correct interpretation for thesis chapters.

How We Teach

Training Mode & Approach

🏫

Class-Based Training

Structured, expert-led sessions online via video call or in person at our Nairobi offices, with step-by-step instruction throughout.

💻

Practical R Use

Every session is hands-on in RStudio. You write real R code, run live analyses, and work directly on your own or provided research datasets.

📝

Take-Away Assignments

Each module includes structured practice assignments to reinforce learning and apply R techniques to your own research data.

🤝

One-on-One Sessions

Individual sessions available on request — ideal for personalised guidance on your specific research project or thesis dataset.

Your Journey

The Three-Module Learning Path

Each module builds progressively — from R and RStudio fundamentals through data wrangling and visualisation to statistical modelling and advanced analysis.

Beginner

Introduction to R & RStudio

R and RStudio setup, R syntax basics, data types, vectors, data frames, importing data, and foundational data manipulation commands.

Intermediate

Data Wrangling & Visualisation

dplyr for data manipulation, tidyr for reshaping, ggplot2 for publication-quality charts, and exploratory data analysis techniques.

Advanced

Statistical Analysis & Modelling

Descriptive statistics, hypothesis testing, correlation, regression, ANOVA, and R Markdown for reproducible research reports.

Course Content

Detailed Module Content

Every topic is covered hands-on in RStudio using real datasets. Each module is Ksh 5,000 per individual and runs for 5 hours.

Mod1

Introduction to R & RStudio Basics

R environment · Syntax · Data types · Vectors · Data frames · Data import & export

Ksh 5,000

5 hours

Introduction to R & RStudio

What is R and how it differs from SPSS, Stata, and SAS

Installing R and RStudio — both free for Windows, Mac, and Linux

The RStudio interface — Console, Script Editor, Environment, and Plots panels

Writing and running R scripts

Setting the working directory

Installing and loading packages — install.packages() and library()

Getting help in R — help(), ?, and documentation

R Syntax & Data Types

Basic R data types

Numeric, integer, character, logical

Factors and ordered factors for categorical variables

Creating and manipulating vectors

Arithmetic and logical operators

Matrices — creating and indexing

Lists — creating and accessing elements

Writing basic functions in R

Data Frames — The Research Dataset in R

Creating data frames from scratch

Indexing and slicing data frames

Adding, renaming, and removing columns

Filtering rows with conditions

Handling missing values (NA) in data frames

Importing & Exporting Data

Importing CSV files with read.csv() and readr

Importing Excel files with readxl

Importing SPSS .sav files with the haven package

Exploring data — head(), str(), summary(), glimpse()

Exporting cleaned data to CSV and Excel

Mod2

Data Wrangling & Visualisation with Tidyverse

dplyr · tidyr · ggplot2 · Data cleaning · Reshaping · Exploratory analysis

Ksh 5,000

5 hours

Data Manipulation with dplyr

Introduction to the Tidyverse ecosystem

The pipe operator (%>%) — chaining commands cleanly

Core dplyr verbs

filter() — subsetting rows by conditions

select() — choosing specific columns

mutate() — creating new variables

arrange() — sorting data ascending and descending

summarise() — aggregating data

group_by() — grouped summaries by category

Joining data frames — left_join, inner_join, full_join

Recoding variables with case_when()

Data Reshaping with tidyr

Wide vs long data format — when and why

pivot_longer() — converting wide to long

pivot_wider() — converting long to wide

separate() and unite() — splitting and combining columns

Handling missing data — drop_na(), replace_na(), fill()

Data Visualisation with ggplot2

The grammar of graphics — layers, aesthetics (aes), and geoms

Core chart types

Scatter plots — geom_point()

Line graphs — geom_line()

Bar charts — geom_bar() and geom_col()

Histograms — geom_histogram()

Box plots — geom_boxplot()

Density plots — geom_density()

Faceting — facet_wrap() and facet_grid() for subgroup plots

Customising themes, colours, labels, and legends

Saving publication-quality plots — ggsave() at 300 dpi for thesis

Mod3

Statistical Analysis & Modelling in R

Descriptive stats · Hypothesis testing · Correlation · Regression · ANOVA · R Markdown

Ksh 5,000

5 hours

Descriptive Statistics

Measures of central tendency — mean, median, mode

Measures of dispersion — standard deviation, variance, range, IQR

Frequency tables and cross-tabulations

Skewness and kurtosis — the psych package

Checking normality — Shapiro-Wilk test, QQ plots, histograms

Descriptive statistics by subgroup using dplyr and summarise()

Hypothesis Testing

Formulating null and alternative hypotheses in R

p-values, confidence intervals, and significance levels

One-sample and two-sample t-tests

Independent samples vs paired t-tests

Chi-square test for categorical variables

Non-parametric tests — Mann-Whitney U, Wilcoxon

Correlation & Regression Analysis

Pearson and Spearman correlation

Correlation matrix visualisation — corrplot package

Simple linear regression — lm() and output interpretation

Multiple linear regression — coefficients, R², F-test, p-values

Regression diagnostics — residuals, normality, linearity, homoscedasticity

Logistic regression for binary outcomes — glm()

ANOVA & Reproducible Reports

One-way ANOVA — aov() and result interpretation

Post-hoc tests — TukeyHSD for pairwise comparisons

Two-way ANOVA with interaction effects

Introduction to R Markdown for reproducible research

Generating Word, PDF, and HTML reports from R Markdown

Publication-ready tables with knitr and flextable

In Practice

Common R Commands You Will Learn

Our training is completely hands-on. You write and run real R code in RStudio from day one — directly applicable to your own research datasets.

Data Import & Exploration


# Load the Tidyverse
library(tidyverse)
library(readxl)
library(haven)

# Import from CSV
df <- read_csv("mydata.csv")

# Import from Excel
df <- read_excel("mydata.xlsx")

# Import from SPSS .sav
df <- read_sav("mydata.sav")

# Explore the dataset
glimpse(df)
summary(df)
head(df, 10)

Data Wrangling with dplyr


# Filter, select, mutate with pipe
df_clean <- df %>%
  filter(age > 18) %>%
  select(id, age, income,
         education) %>%
  mutate(
    log_income = log(income),
    edu_group = case_when(
      education < 12 ~ "Primary",
      education < 16 ~ "Secondary",
      TRUE ~ "Tertiary"
    )
  )

# Grouped summary statistics
df %>%
  group_by(edu_group) %>%
  summarise(
    mean_inc = mean(income,
                    na.rm = TRUE),
    n = n()
  )

Visualisation with ggplot2


# Scatter plot with regression line
ggplot(df, aes(x = age,
                y = income)) +
  geom_point(color = "steelblue",
             alpha = 0.6) +
  geom_smooth(method = "lm",
              se = TRUE) +
  labs(
    title = "Age vs Income",
    x = "Age (years)",
    y = "Monthly Income (Ksh)"
  ) +
  theme_minimal()

# Save for thesis (300 dpi)
ggsave("plot.png", dpi = 300)

Statistical Tests & Regression


# Descriptive statistics
psych::describe(df$income)

# Shapiro-Wilk normality test
shapiro.test(df$income)

# Pearson correlation
cor.test(df$age, df$income,
         method = "pearson")

# Multiple linear regression
model <- lm(income ~ age +
    education + gender, data = df)
summary(model)

# One-way ANOVA
fit <- aov(income ~ edu_group,
            data = df)
TukeyHSD(fit)

Why R

Key R Capabilities You Will Master

R’s combination of statistical power, free availability, and exceptional visualisation makes it one of the most valuable tools a researcher in Kenya can learn.

🤷

Completely Free & Open Source

R and RStudio are both 100% free — no licence fees, no annual renewals needed. Install on any computer and use for life at zero cost.

📊

Excellent Data Visualisation

ggplot2 produces publication-quality, highly customisable charts far superior to SPSS or Excel — perfect for theses and journal submissions.

🔧

Powerful Data Wrangling

The dplyr and tidyr packages make data cleaning, filtering, reshaping, and transformation intuitive and fast — even for complex, messy datasets.

📈

Comprehensive Statistical Analysis

From basic descriptive statistics to advanced regression, ANOVA, and multivariate analysis — R covers every technique required for postgraduate research.

📄

Reproducible Research with R Markdown

Combine code, results, and narrative in one document — generating Word, PDF, or HTML reports automatically. Essential for transparent thesis writing.

📦

20,000+ Free Packages

CRAN hosts over 20,000 R packages covering every analytical need — from basic statistics to machine learning and bioinformatics — all completely free.

Is This For You?

Who Should Attend R Software Training?

R is valuable for researchers who want advanced visualisation, greater analytical flexibility, and a tool that will never cost them a licence fee.

🎓

Masters & PhD Students

Need to perform data analysis for your dissertation but want a more flexible, free alternative to SPSS? R fully covers everything required for Chapters 4 and 5 — regression, ANOVA, hypothesis testing, and publication-quality charts.

🌎

Social Science & Development Researchers

R was specifically designed for socio-economic and social science data analysis. Whether analysing household surveys, poverty data, or community health datasets — R handles it all.

📊

Economists & Finance Professionals

Need advanced econometric techniques beyond SPSS? R’s packages for panel data, time-series, and regression modelling give you complete analytical freedom.

🏥

Public Health Researchers

R is widely used for epidemiological analysis, survival analysis, and logistic regression — ideal for public health, clinical research, and health policy dissertations.

🔄

SPSS Users Looking for More Flexibility

Already know SPSS but feeling limited? R offers everything SPSS does — and much more — for free. We cover transitioning your existing SPSS .sav files into R using the haven package from day one.

💼

Data Analysts & Professionals

Need to add R to your professional skill set? Our training equips you with job-ready data wrangling, visualisation, and reporting skills using Tidyverse and R Markdown.

Questions Answered

Frequently Asked Questions

Everything you need to know before enrolling in our R software training programme.

Do I need any programming experience to learn R?

No programming experience is required. Module 1 starts completely from scratch — we cover how to install R and RStudio, understand the interface, and write your very first R commands step by step. The training is designed for researchers and students, not programmers. Many participants have no coding background at all and successfully complete all three modules.

How much does R cost to download and use?

R and RStudio are both completely free to download and use — there are no licence fees, annual renewals, or student discount programmes needed. You simply download R from CRAN and RStudio Desktop from the Posit website, both at no cost. This makes R one of the most financially accessible statistical tools for Kenyan researchers.

Is R accepted in Kenyan university dissertations?

Yes. R is accepted and increasingly encouraged across Kenyan universities for postgraduate data analysis — particularly in economics, social sciences, public health, environmental science, and business research. Its use is growing rapidly as an alternative to SPSS, especially at PhD level where more advanced analytical techniques are required.

Can I take just one module — for example, only Module 2 for ggplot2 and dplyr?

Yes. Each module is independently bookable at Ksh 5,000. If you already know R basics and want to focus on data wrangling with dplyr and visualisation with ggplot2, you can book Module 2 alone. We recommend a working knowledge of R basics before joining Module 2 or 3 independently.

How does R compare to SPSS for research analysis?

SPSS is a point-and-click interface suited to standard statistical tests. R is a full programming language — which means a slightly steeper initial learning curve, but far greater flexibility, power, and zero cost. R’s ggplot2 produces significantly better visualisations than SPSS, it handles more advanced models, and it never costs a licence fee. Many researchers use both — SPSS for quick analysis and R for visualisation and advanced modelling.

Will the training use my own research data?

Absolutely. Bringing your own research dataset makes every exercise directly applicable to your thesis. If you are still collecting data, we will use representative example datasets covering socio-economic, health, and education research contexts.

Is R Markdown covered in the training?

Yes. R Markdown is covered in Module 3. You will learn how to create reproducible research reports combining your R code, analysis results, and written narrative — generating Word, PDF, or HTML documents automatically. This is an extremely valuable skill for thesis writing and increasingly required for journal publication.

Ready to Begin?

Book Your R Software Training Session

Fill in our booking form or contact us directly by phone or WhatsApp. Our team is available Monday to Saturday to confirm your slot and answer any questions.

📋 Book for Training 📞 +254 728 430 728 💬 WhatsApp Us

Get In Touch

Have Questions? Contact Us Directly

We are happy to help you decide which module to start with or discuss how R fits your specific research needs.

📞

Call Us+254 728 430 728

💬

WhatsApp+254 728 430 728

✉

Email Usinfo@tobitresearchconsulting.com

R Software Training in Kenya — Statistical Computing & Data Analysis

R Software Training in Kenya — Statistical Computing & Data Analysis

Course Details

What is R & Why Should Researchers Learn It?

ggplot2

dplyr

tidyr

readr

tidyverse

R Markdown

What You Will Achieve

Training Mode & Approach

Class-Based Training

Practical R Use

Take-Away Assignments

One-on-One Sessions

The Three-Module Learning Path

Introduction to R & RStudio

Data Wrangling & Visualisation

Statistical Analysis & Modelling

Detailed Module Content

Introduction to R & RStudio Basics

Introduction to R & RStudio

R Syntax & Data Types

Data Frames — The Research Dataset in R

Importing & Exporting Data

Data Wrangling & Visualisation with Tidyverse

Data Manipulation with dplyr

Data Reshaping with tidyr

Data Visualisation with ggplot2

Statistical Analysis & Modelling in R

Descriptive Statistics

Hypothesis Testing

Correlation & Regression Analysis

ANOVA & Reproducible Reports

Common R Commands You Will Learn

Key R Capabilities You Will Master

Completely Free & Open Source

Excellent Data Visualisation

Powerful Data Wrangling

Comprehensive Statistical Analysis

Reproducible Research with R Markdown

20,000+ Free Packages

Who Should Attend R Software Training?

Masters & PhD Students

Social Science & Development Researchers

Economists & Finance Professionals

Public Health Researchers

SPSS Users Looking for More Flexibility

Data Analysts & Professionals

Frequently Asked Questions

Book Your R Software Training Session

Have Questions? Contact Us Directly

info@tobitresearchconsulting.com

+254728430728

Copyright ©

Contact Us. We are ready to help you!

Let's have a chat