What is predict() in R?
With the predict()
function in R, you can make predictions for new, unseen data. This function is an important tool for machine learning.
What is predict() in R used for?
The R function predict()
is a versatile tool used in predictive modeling. It generates predictions for new or existing data points based on a previously designed statistical models, such as a linear regression, logical regression, decision trees and other modeling techniques.
What is the syntax for predict() in R?
R’s predict()
function takes as arguments a trained model and the data points that the prediction should apply to. You can specify different options and parameters based on the type of model used. The result is a vector of predictions that can be useful for various analytical purposes, including evaluating the performance of a model, decision making or illustrating the resulting data.
object
: The trained model that the predictions are applied tonewdata
: The data point for the predictioninterval
: Optional argument for entering the type of confidence interval (confidence
for mean interval,prediction
for predictions)
Example of how to use predict() in R
The following example will illustrate how the predict()
function in R works. We’ll use a user-defined data set with speed and distance values.
Creating and displaying data
First, we’ll create a user-defined data set for evaluating the relationship between speed and distance. We’ll use the function data.frame()
to create a data frame and then define the values for the variables speed
and distance
as c(15, 20, 25, 30, 35)
and c(30, 40, 50, 60, 70)
respectively.
After we’ve created the data set, we’ll display it using the print()
function. That way we can check the structure and the assigned values of our new data frame.
Output:
Creating a linear model
Output:
In the output, we see a linear model (custom_model
) that was generated for the data set and models the relationship between speed and distance. We get the result of the model, including coefficients and statistical information.
Defining new speed values and making predictions
We’ve now created another data set (new_speed_values
) with new values for speed. Then we used R predict()
to make predictions for the corresponding distance values using the linear model we created above.
Displaying the predictions
The output shows the distance values predicted based on speed:
If you want to learn about processing strings for text manipulation and data cleaning in R, take a look at our tutorials on R gsub and sub and R substring.
- 99.9% uptime
- PHP 8.3 with JIT compiler
- SSL, DDoS protection, and backups