PMID-sentid Pub_year Sent_text comp_official_name comp_offsetprotein_name organism prot_offset 8002930-5 1994 The overall composition of the deduced amino acid sequence matched that expected for a mucin protein core and is rich in serine, threonine, proline, glycine and alanine (approximately 51%). Proline 140-147 LOC100508689 Homo sapiens 87-92