Portfolio
Featured Projects
Deciphering Protein Sequence Relationships: A Python Based Approach
- Utilized Python scripting and bioinformatics libraries (e.g., Biopython) to analyze protein.
- Gained proficiency in sequence alignment, phylogenetic tree construction, and structure prediction using BLAST, INTERPROSCAN, SWISS-MODEL.
- Identified key functional domains and structural motifs in proteins using Python scripting and Biopython, contributing to a deeper understanding of protein function and evolution.
NLP-Driven Gene Function Prediction using BERT in Biological Sequences
- Working on a gene function prediction using NLP (BERT model) on biological sequences, focusing on TensorFlow.
- Using BERT, we train on biological sequences to identify gene regions, predicting functionality in novel sequences with NLP, TensorFlow, and robust algorithms.
- Automating gene identification and characterization with BERT can significantly advance gene function understanding for genomic research and applications.
Highly accurate protein structure prediction with Alphafold
-
Leveraged AlphaFold AI and ColabFold to explore protein folding, employing advanced predictive analysis and visualization tools (e.g., PyMOL) for structural assessment.
-
Demonstrated understanding of cutting-edge deep learning models in protein structure prediction and their significance in decoding complex protein architectures.
-
Utilized cutting-edge AlphaFold AI and ColabFold to predict protein structures with high accuracy (e.g., pLDDT > 90), illuminating potential applications in structural biology, drug discovery, and collaborative research.
Data Analysis on the prevalence of Anemia and its factors in Pregnant and Non-Pregnant Women.
-
Performed comprehensive statistical analysis of anemia prevalence in women using WHO datasets and R/Python.
-
Analyzed hemoglobin concentration distribution through box plots, confusion matrices, and histograms, identifying potential biases and disparities.
Featured Reports
Exploring Gene Expression Changes in Prostate Cancer Cells Under Hypoxia
In this project, I explore differential gene expression and pathway enrichment in two prostate cancer cell lines—LNCaP and PC3—under different oxygen conditions: hypoxia (low oxygen) and normoxia (normal oxygen). Hypoxia is a common feature of tumor environments and...
Understanding Genome Assembly Using De Bruijn Graphs: From Concepts to Code
Genome assembly is a key concept in bioinformatics, and it involves reconstructing a long DNA sequence from short, overlapping fragments known as reads. The purpose of this article is to explain how we can use De Bruijn graphs and Eulerian paths to solve genome...
Different types of Bioinformatics data
Bioinformatics data encompasses various types, each crucial for advancing our understanding of biological systems and enhancing medical research. Bioactivity data reveals how compounds interact with biological systems, essential for drug discovery and safety...