PMID-sentid Pub_year Sent_text comp_official_name comp_offsetprotein_name organism prot_offset 31933987-7 2019 Using machine learning, LASSO selected the subset of variables that minimized the predictive error of the outcome, including CEA, NSE, CYFRA 21-1, CAMKII, tumor size, histologic type, lymph node status, smoking, and age. alachlor 24-29 calcium/calmodulin dependent protein kinase II gamma Homo sapiens 147-153