par(mfrow = c(2,2))
curve(x/(1+x), 0, 10)
curve(1 - exp(-x), -100, -90)
curve(exp(x)/(1+exp(x)), -5, 5)
curve(x * exp(x), 95, 100)
Non-linear Regression
Dr. Peng Zhao (✉ peng.zhao@xjtlu.edu.cn)
Department of Health and Environmental Sciences
Xi’an Jiaotong-Liverpool University
1 Learning objectives
- Understand what is non-linear regression
- Understand how non-linear regression is performed
- Evaluate non-linear regression models
- Choose possible non-linear models
2 Principle
2.1 Definition
- Non-linear regression:
-
A model for the relationship between the dependent variable(s) and the independent variable(s) is not linear but a curve
Two categories:
1. One that can be transformed into a linear model
2. One that cannot be transformed into a linear model
Examples:
: Set , then . : Set , then : Set and , then . : cannot be transformed into linear.
Name | Equation |
---|---|
Asymptotic functions | |
Michaelis-Menten | |
2-parameter asymptotic exponential | |
3-parameter asymptotic exponential | |
S-shaped functions | |
2-parameter logistic | |
3-parameter logistic | |
4-parameter logistic | |
Weibull | |
Gompertz | |
Humped curves | |
Ricker curve | |
First-order compartment | |
Bell-shaped | |
Biexponential |
2.2 Model
: The response variable. : The explanatory variable. : Unknown parameter vector. : Random error term.
Assumptions:
: The mean value of random error term is 0 , where : No correlation. , where : Equal variance- The explanatory variable is non-random variable
is assumed to be a continuous differentiable function
2.3 Methods
Non-linear least squares (NLS):
Partial coefficient of determination: Measure the fitting of non-linear regression model.
3 Workflow
3.1 Data
Reaction rate ~ concentration
<- read.table("data/mm.txt", header = TRUE)
dtf library(ggplot2)
ggplot(dtf) + geom_point(aes(conc, rate))
3.2 Fit the Model
Michaelis-Menten model
- Biochemistry: the relationship between the reaction rate and the substrate concentration
- Ecology: (Holling’s disc equation) the relationship between the feeding rate of predator and the prey density.
Features:
- The curve passes through the origin.
- Rising rate of the curve diminishes with the increasing of x.
- Function has an asymptote
. when .
<- nls(rate ~ a * conc / (b + conc), data = dtf, start = list(a = 200, b = 0.03))
m1
m1summary(m1)
3-parameter asymptotic exponential model
<- nls(rate ~ a - b * exp(-c * conc), data = dtf, start = list(a = 200, b = 150, c = 5))
m2
m2summary(m2)
<- seq(0, 1.2, .01)
xv <- predict(m1, list(conc = xv))
y1 <- predict(m2, list(conc = xv))
y2 <- data.frame(xv, y1, y2)
dtf_predicted ggplot(dtf) +
geom_point(aes(conc, rate)) +
geom_line(aes(xv, y1), data = dtf_predicted, color = 'blue') +
geom_line(aes(xv, y2), data = dtf_predicted, color = 'red')
Partial coefficient of determination:
<- function(model) {
calc_R2 <- summary(model)
ms <- as.vector((ms[[3]])^2 * ms$df[2])
sse <- lm(dtf$rate ~ 1)
null <- as.vector(unlist(summary.aov(null)[[1]][2]))
sst 1 - sse/sst
}
calc_R2(m1)
[1] 0.9612608
calc_R2(m2)
[1] 0.9672987
3.3 Results
The Michaelis-Menten model can explain 96.1% of the total variation in the reaction rate, while the 3-parameter asymptotic exponential model can explain 96.4%.
3.4 Automatic starting values
<- nls(rate ~ SSmicmen(conc, a, b), data = dtf)
m3 summary(m3)
<- nls(rate ~ SSasympOff(conc, a, b, c), data = dtf)
m4 summary(m4)
Function | Model |
---|---|
SSasymp() |
asymptotic regression model |
SSasympOff() |
asymptotic regression model with an offset |
SSasympOrig() |
asymptotic regression model through the origin |
SSbiexp() |
biexponential model |
SSfol() |
first-order compartment model |
SSfpl() |
four-parameter logistic model |
SSgompertz() |
Gompertz growth model |
SSlogis() |
logistic model |
SSmicmen() |
Michaelis–Menten model |
SSweibull() |
Weibull growth curve model |