丢弃 200 个随机健康实例。如何在 Rstudio 中实现这一点?
这是数据框:
https://www.kaggle.com/code/jamaltariqcheema/model-performance-and-comparison/data
我试过这个,但我得到了一个错误。
kidney_disease$hd <- ifelse(test=kidney_disease$hd == 0, yes="Healthy", no="Unhealthy")
回答1
也许以下解决了问题的问题。
用sample
随机选择行号,将默认value "Healthy"
分配给新列hd
并分配value "Unhealthy"
到随机选择的行。
set.seed(2022) # Make results reproducible
i <- sample(nrow(kidney_disease), 200)
kidney_disease$hd <- "Healthy"
kidney_disease$hd[i] <- "Unhealthy"