Lung-Cancer-3
Can Machine Learning Really Detect Lung Cancer?

“Artificial Intelligence (AI)” – a topic that is so intriguing throughout last decade that even Elon Musk and Mark Zuckerberg debated over recently. Key integral part of an AI is Machine Learning (ML) which allows a machine to learn how we want them to think. Although it has been almost fifty years since ML and related research work existed, we started exploring its diverse potential to solve many of human problems in this 21st century. One of its blessing we have got so far is in the area of bioinformatics.

Severity of Cancer Diagnosis

Every cancer is unique in its own way. Despite the fact, researchers are continuously trying to identify the causality to make prevention possible. Furthermore, detection of cancer is as important as prevention. Because early diagnosis of cancer shows a higher chance of survival. For instance, a lung cancer patient with localized cancer has 55% survival rate whereas if the tumor spreads to other organ chance drops to 4%. However, only 16% of lung cancer cases are diagnosed at an early stage. Moreover, in 2014, almost 14.5 million people in the US was beyond the reach of cancer diagnosis. Rest of the world especially underdeveloped and developing countries lack more behind in having such facilities.  Reason behinds this includes the necessity of high-quality equipment and expert physicians to diagnose cancer. This is why researchers are trying to incorporate AI and ML algorithms to make the diagnosis process simpler with remarkable accuracy.

Recent Development Through ML

In 2017, Kaggle, most popular data science learning and competition platform hosted Data Science Bowl featuring cancer diagnosis problem. Open sourcing CT scan images to the public, it asked data scientists to come up with a machine learning model that better predicts the probability of lung cancer. In return, Kaggle offered its highest prize money to this date valued one million US dollar. It was three months long competition from February 2017 to April 2017 where almost two thousand teams across the globe participated. After finishing the competition, a new initiative  Concept To Clinic is being taken to develop an open source solution for clinics. This system will enable radiologists to access the solution developed by the researchers through SAAS. Concept To Clinic is funded by The Bonnie J. Addario Lung Cancer Foundation which aim is to make lung cancer a chronically managed disease by next five years. Targeting this challenging vision, development of the system has already begun and has been open sourced on the first week of August 2017 for engineers and data scientists. This initiative also offers monetary rewards based on the contribution one makes. Total prize money valued 100,000 US dollar is announced for top contributors.

Behind The Scene

Detection of lung cancer happens in two major steps: 1) creating ML model and 2) using the model to predict the cancerous region of lungs. To create or train an ML model, researcher feed-in CT scan DICOM image files along with attributes such as tumor size, malignancy information. Using that information normal or irrelevant regions are ignored. To rectify abnormal regions, an ensemble of multiple statistical models are used. This model is checked and cross-validated on test data which is not used in training. Upon successful development of the model and satisfiable accuracy, it is used to predict the probability of cancer tumor.

How It Impacts Human Race

According to SEER report, Lung cancer is the deadliest cancer which kills more people than breast, colon, and prostate cancer combined. Every 3.3 minutes someone in the U.S. dies of lung cancer. Currently, CT scan which is a 3D image of lungs is used to detect possible cancerous regions. These scans are then carefully observed by trained radiologists. This procedure not only requires expert eyes but also labs with proper equipment. Nonetheless, it results in high false-positive detection; people with no cancer may be treated for cancer unnecessarily which is not only an economical burden but also psychologically stressful for both the patient and his/her family.

Prediction through ML can make this diagnosis process much simpler and reliable. Although statistics of current cancer diagnosis is making health experts upset, results from research and development in this area are showing lights of hope.

Like
Like Love Haha Wow Sad Angry
bioinformatics in dna
My First Step towards Bioinformatics

Bioinformatics is a rising area which combines multiple discipline including Biology, Computer Science, Psychology, Mathematics and Statistics, Chemistry.

I have a keen interest in Biological Science. I had courses in high school in Biology and I did quite well there. But when I had to choose my undergrad major I chose Computer Science as I have more interest in it. We did not have graduation degree on Bioinformatics in our country. To be frank, I did not plan to move to Bioinformatics in my undergrad life as well. My target was to learn as much knowledge as possible in diverse area of Computer Science. When I was exposed to AI, Machine Learning I decided to do MS in this topic. As a consequence, I decided to do major in Intelligent System in my MS. During MS I started to develop interest in Bioinformatics.

Since there were no courses related to Biological Science or Bioinformatics directly, I started to read on my own. I started with Human Cognition and development of AI cognition. TV series Westworld encouraged me in this particular case. Eventually I searched a lot of content on google. Among them there was a list of Cognitive AI projects on Wikipedia. Additionally, it contains other interesting projects in AI. Among the projects OpenCog interested me. It seemed a nice framework designed to develop AI systems.

One day I was reading some articles on attributes which defines and control human psychology. I tried to understand functionality of hormones like Estrogen, Testosterone, Serotonin and so on which are responsible for various activity in our body and mind how various hormones are controlled by our Endocrine System. During that time, I found some competitions on Kaggle which required to solve problem related with various type of Cancer by the help of machine learning and neural network problem. I was not well equipped with the knowledge required to solve those problem. I knew ML and NN algos but I felt that if I know more about those problem, it may allow me to discover more interesting research problem that exists in current world.

Following my curious mind, I started digging about cancer and cancer mutation. I started reading about our primal components like DNA, RNA and proteins. I tried to understand how mutation works and when it fails, how cancer cell affects normal mutation. I will try to write a separate article regarding cancer genomics in future and some interesting aspect that fascinated me including the resources that helped me.

Like
Like Love Haha Wow Sad Angry