DISCUSS SAMPLE DISTRIBUTIONS
The sample size is 580, which is quite a large sample. As this sample
size is quite large it appears to be a good representation of the whole
population. This data was collected from a sample of VLBW babies born during
the 1980's in the USA.
The sample of 580 birth weights of VLBW babies represents how I would have
predicted the whole population of birth weights of VLBW babies to look and so I
am comfortable to proceed with the analysis based on this sample.
Looking at the sample of 581 VLBW baby weights, we see an approximately
normal distribution, with a little bit of a left skew.
The median birth weight of VLBW babies that do not survive is 850g
which is 350g lighter than the median birth weight of VLBW babies that do
survive (which is 1200g). As the sample size is 581 which is a good
representation of the population of VLBW babies born in America in the 1980s it
appears my theory that VLBW babies tend to be more likely to survive when they
are heavier is true.
For this sample we can see from the graph that the median birth-weight
for infants that survived is quite a bit higher (170g higher) than the upper
quartile of birth-weight of VLBW babies which is approximately 1030g The upper
quartile of VLBW babies that survived is substantially higher than the whole
middle 50% of VLBW babies that didn't survive. The upper quartile of VLBW
babies that survived lies at 1350g which is shifted further up the scale by
320g than the middle 50% of VLBW babies that died. This again enforces my
belief that the lower the birth weight of VLBW babies, the higher the risk of
death.
For this sample the distribution for the VLBW babies that did survive
is skewed to the left, whilst the distribution for the VLBW babies that did not
survive is only partially skewed to the right. This appears to be because the
data for the VLBW babies that did not survive is very evenly spread whilst the
data for the VLBW babies that did survive seems to take a bell shape with the
majority of the data between 1000-1500g. This is relatively close to what I had
expected the shape of the data to look like and again seems to prove that VLBW
with lower birth weights are less likely to survive that VLBW babies with
higher birth weights,
In this sample there is a lot less data provided for the VLBW babies that
did not survive as opposed to the amount of data provided for the VLBW babies
that did survive. It would be beneficial to take a different sample with a more
even amount of data each way so we can fairly compare the birth weights of VLBW
babies that did survive and VLBW babies that did not survive.
STEP 4: DISCUSS SAMPLING VARIABILITY, INCLUDING VARIABILITY OF
ESTIMATES
Based on this sample estimate it appears that the difference between
the median birth weight of VLBW babies that survive and VLBW babies that do not
survive is 350g. However it must be recognized that thing conclusion is drawn
from just one sample and could be due to chance alone. In order to refine this,
multiple samples would need to be taken and analysed.
This problem is called sampling variability and is an issue commonly
faced when using a sample statistic to estimate a population parameter.
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.