Predicting birthweight based on maternal risk factors (34 points). In this problem, we will consider predicting the birthweight (birthwt) of newborns in milligrams, using the age of the mother at the time of delivery (age), the weight of the mother at the last menstrual period (lwt) in kilograms, and the mother’s race (race). For maternal race, we have three levels: “white,” “black,” and “other.” This type of analysis is of clinical interest to obstetricians and obstetrics researchers who want to study the risk factors for low birth weight (birth weight less than 2.5 kg), which is a common adverse pregnancy outcome. The dataset birthweight.csv contains this data on 𝑛 = 189 randomly selected deliveries.

a. (2 points) Download the dataset birthweight.csv and read it into R. In order to register the race column as a factor (categorical variable), please use the following R code:

birth.dat <- read.csv(“birthweight.csv”, header=T, stringsAsFactors = T)

Next, make white the baseline (or reference) group for race with the following line of code:

birth.dat$race <- relevel(birth.dat$race, ref=”white”)

Finally, fit a multiple linear regression model to this data, with birthwt as the response variable. What is the fitted equation for our model, and what is the adjusted 𝑅2 for this model?

b. (2 points) We know that the patients in this study are independent, so the independent errors assumption is reasonable. Use diagnostic plots to check the other model assumptions for linear regression and check for outliers. Include these plots in your homework submission. Are the assumptions met, and are there any clear influential points?
c. Now we will conduct inference for our model.

i. (1 point) What are the F-statistic, p-value, and conclusion of the F- test for our model?

ii. (3 points) What are the interpretations for the estimates of the non- intercept regression coefficients in the context of this problem?

iii. (2 points) What are the t-statistics, p-values, and conclusions of the t-tests for the non-intercept regression coefficients?

iv. (3 points) What are the confidence intervals for the non-intercept regression coefficients in the context of this problem? Give their interpretation in the context of this problem.

d. (2 points) Conduct a marginal analysis of the association between age and birthweight. Is age marginally associated with birthweight? Explain.

Instant Solution Available for $5

Chat with us or submit your question here.

Related Questions and Answers

Predicting health care charges of insurance beneficiaries (30 points). Health insurance companies often have to cover a sizable portion of the health care costs of their primary beneficiaries. Thus, it is of practical interest for these companies to predict the cost of medical bills from individual patient characteristics. In this problem, we will consider predicting the medical costs (or charges) in dollars for 𝑛 = 1338 primary beneficiaries

Check Solution »

Predicting home sales prices from square footage (36 points). In this problem, we will consider predicting house sale prices (SalePrice) from square footage (SqrFeet) using a dataset of 𝑛 = 506 houses.

Check Solution »

A prime number (or a prime) is a natural number that has exactly two distinct natural number divisors: 1 and itself. The purpose of this problem is to write a function (say its name is check.prime) to check whether or not a given natural number is a prime. Unless you want to use some other more advanced method, you can write your function based on the so-called “trial division” method. The idea is as follows. For a positive integer

Check Solution »

Consider the following contingency table: What is the probability of event
a. D′?
b. D and C
c. D′ and C′?
d. D′ or C′?

Check Solution »

A box contains 14 red pens and 10 green pens. A pen is to be selected at random. Give an example of a simple event.

Check Solution »

Three coins are tossed.
a. Give an example of a simple event.
b. Give an example of a joint event.
c. What is the complement of a head?

Check Solution »

A local public-action group solicits donations by telephone. For a particular list of prospects, it was estimated that for any individual, the probability was .05 of an immediate donation by

Check Solution »

A manager has available a pool of eight employees who could be assigned to a project-monitoring task. Four of the employees

Check Solution »

Each year, ratings are compiled concerning the performance of new cars during the first 90 days of use. Suppose that the cars have been categorized

Check Solution »

Share this question:

Facebook
Twitter
Pinterest
LinkedIn
WhatsApp

Get Step-by-Step Solutions

Experience expert help with your homework
RECENT REVIEWS
Kimberly
Kimberly
Statistics
Read More
Excellent work. Meet my expectations. Thanks.
John
John
Math
Read More
" Learnmathstat.com " is a name that MUST remember when you have a project in mathematics, even if that project is related to an advanced course!
Eva
Eva
Algebra
Read More
Very professional, high quality, and always delivers on time.
Previous
Next
Scroll to Top