{"componentChunkName":"component---src-templates-card-post-js","path":"/ml-readiness-across-3-data-sources-preclinical-research-clinical-research-and-clinical-practice/","result":{"data":{"site":{"siteMetadata":{"title":"Analog Intelligence"}},"markdownRemark":{"id":"bf9a754f-42bc-559a-b1cc-99191fcef78c","excerpt":"The state of being ML-ready can vary widely across different data types, primarily influenced by their source. The distinction here is between experimental data…","html":"<p>The state of being ML-ready can vary widely across different data types, primarily influenced by their source. The distinction here is between experimental data collected within preclinical or clinical research, and real world clinical data.</p>\n<p>Any data collected within an experimental context - a drug discovery assay or a clinical trial - often come with high levels of ML-readiness. This is a function of inherent control over the experimental design, characteristics of subjects/samples, and potential confounders.</p>\n<p>Real world clinical data, on the other hand, is provided as is. Good luck convincing a physician to write a clinical note in a more standard and structured manner, or getting a technician to change the way a routine CT image is acquired. This translates to low levels of control.</p>\n<p>Readily annotated data enables supervised learning. In experiments, labels are identified a priori and are therefore available from day one. In a real world setting, labels are created by different stakeholders in different locations at different times.</p>\n<p>This requires multiple glue layers to construct a single annotated training sample. It may start with a patient’s MRI from the radiology PACS, linking it to an oncologist’s note in the EHR, and coupling it with a pathologist’s cancer grade from LIMS. </p>\n<p>Working with ML-ready data enables data professionals to spend more time on the actual modeling, storytelling, and visualization. The other end of the spectrum requires considerable data engineering effort before diving into any ML work. </p>","frontmatter":{"title":"ML-readiness across 3 data sources: Preclinical research, clinical research, and clinical practice","number":"008","card":{"childImageSharp":{"fluid":{"base64":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABQAAAALCAYAAAB/Ca1DAAAACXBIWXMAABYlAAAWJQFJUiTwAAACRElEQVQoz6VSyW4aQRTk/6wIK3dk5WDlF6wYkRytXJzlECuSLcWxnWDARgaCbVYNm4bdGYYxMCaEZXpYzKIsUg6typshkIhrDqV+XdNdr15NW7rdLv4HvV5vCWNvMQqmadBXYPAaYZXX/lxenOmsiFuMIiDm4YwLcGdEeHJFuMQClC8tpKUKTmLEJzM4I/6UzuVqDciqimPii9Uqvs1mmE6nGN/fzx0aHd7Tx71wHEeZnInDdBZVErzOF/E6GMK7lIizfAlNvQ99MEBWVvAycIVs4w4TEuvfj6EPR3PBLtk8jidwIqSgagxqT0ODoDEdV7kC9i7DUNpdaMMhfv38ge8kIMpVvLkMwU2OgzcVXBRvEKsof0c+IodvryPwlST4yxIuSp9R/9qeO/Rf4kBI45RGblBDrd9HrqrgFfGCJKPVH6BOBu4IvYXDi2QavkTKDHpGDqbjMYY0mlAs40M4ivNUBr5CmbLNI08ZVuoNfIzE4BaS8FKjc0KEnJoOW60W3C4Xnj11wOFwwGazwWq14lMwiERCwPOdHdi3t/FoYwMP1tZwsL8PWZbxYncXDrsdjzc38XB9HU+2tuaCnU6HFwoFHo1GeSQS4V6vl3s8Hi5JEq8qCo/H4ybv8/u50+nkoijyZrPJhUSCh8Nh7g8EuMvl4qFQiJMgN9+hrusYUuij0QiTycSEwTHGlvyYYjCex4Ci0CgaY/2XN2rzL1M3ptRq7Pb2ltXrdXOt0d6AsV/UBhRFYaqqMrrIaLIl2u22uZI59hvN7/Dc/AiP2QAAAABJRU5ErkJggg==","aspectRatio":1.7699115044247788,"src":"/static/69bcc715ec13c1c081244db479b3494b/bc8e0/008.png","srcSet":"/static/69bcc715ec13c1c081244db479b3494b/8ac63/008.png 200w,\n/static/69bcc715ec13c1c081244db479b3494b/3891b/008.png 400w,\n/static/69bcc715ec13c1c081244db479b3494b/bc8e0/008.png 800w,\n/static/69bcc715ec13c1c081244db479b3494b/6050d/008.png 1200w,\n/static/69bcc715ec13c1c081244db479b3494b/00405/008.png 1600w,\n/static/69bcc715ec13c1c081244db479b3494b/9e50a/008.png 2401w","srcWebp":"/static/69bcc715ec13c1c081244db479b3494b/ccdb5/008.webp","srcSetWebp":"/static/69bcc715ec13c1c081244db479b3494b/6b183/008.webp 200w,\n/static/69bcc715ec13c1c081244db479b3494b/fc32b/008.webp 400w,\n/static/69bcc715ec13c1c081244db479b3494b/ccdb5/008.webp 800w,\n/static/69bcc715ec13c1c081244db479b3494b/9000d/008.webp 1200w,\n/static/69bcc715ec13c1c081244db479b3494b/e0075/008.webp 1600w,\n/static/69bcc715ec13c1c081244db479b3494b/c351a/008.webp 2401w","sizes":"(max-width: 800px) 100vw, 800px"}}}}}},"pageContext":{"slug":"/ml-readiness-across-3-data-sources-preclinical-research-clinical-research-and-clinical-practice/","previous":{"fields":{"slug":"/promises-ai-artificial-intelligence-small-molecule-drug-discovery/"},"frontmatter":{"title":"The promises of AI in small molecule drug discovery","type":"card"}},"next":{"fields":{"slug":"/flagship-pioneering-annual-letter-2022-four-forces-shaping-biotech/"},"frontmatter":{"title":"Flagship pioneering: Four forces shaping biotech","type":"card"}}}},"staticQueryHashes":["2841359383"]}