Skip to contents

Data set taken from (Crawley 2012) and posteriorly analyzed by (Lemonte et al. 2020) . The data includes the count of infected blood cells per square millimetre on microscope slides prepared from n = 511 randomly selected individuals.

Format

A data frame with 511 rows and 5 variables:

  • cells: count of infected blood cells per square millimetre on microscope slides

  • smoker: smoking status of the subject (0: smoker; 1: non smoker)

  • gender: subject's gender (1: male; 0: female).

  • age: subject's age categorized into three levels: young (\( \le 20\)), mid (21 to 59), and old (\(\ge 60\)).

  • weight: body mass score categorized into three levels: normal, overweight, obese.

References

Crawley MJ (2012). The R Book, 2nd edition. Wiley Publishing. ISBN 0470973927.

Lemonte AJ, Moreno-Arenas G, Castellares F (2020). “Zero-inflated Bell regression models for count data.” Journal of Applied Statistics, 47(2), 265-286. doi:10.1080/02664763.2019.1636940 .