Portfolio
Featured Projects
Deciphering Protein Sequence Relationships: A Python Based Approach
- Utilized Python scripting and bioinformatics libraries (e.g., Biopython) to analyze protein.
- Gained proficiency in sequence alignment, phylogenetic tree construction, and structure prediction using BLAST, INTERPROSCAN, SWISS-MODEL.
- Identified key functional domains and structural motifs in proteins using Python scripting and Biopython, contributing to a deeper understanding of protein function and evolution.
NLP-Driven Gene Function Prediction using BERT in Biological Sequences
- Working on a gene function prediction using NLP (BERT model) on biological sequences, focusing on TensorFlow.
- Using BERT, we train on biological sequences to identify gene regions, predicting functionality in novel sequences with NLP, TensorFlow, and robust algorithms.
- Automating gene identification and characterization with BERT can significantly advance gene function understanding for genomic research and applications.
Highly accurate protein structure prediction with Alphafold
-
Leveraged AlphaFold AI and ColabFold to explore protein folding, employing advanced predictive analysis and visualization tools (e.g., PyMOL) for structural assessment.
-
Demonstrated understanding of cutting-edge deep learning models in protein structure prediction and their significance in decoding complex protein architectures.
-
Utilized cutting-edge AlphaFold AI and ColabFold to predict protein structures with high accuracy (e.g., pLDDT > 90), illuminating potential applications in structural biology, drug discovery, and collaborative research.
Data Analysis on the prevalence of Anemia and its factors in Pregnant and Non-Pregnant Women.
-
Performed comprehensive statistical analysis of anemia prevalence in women using WHO datasets and R/Python.
-
Analyzed hemoglobin concentration distribution through box plots, confusion matrices, and histograms, identifying potential biases and disparities.
Featured Reports
Bioinformatics essential file formats and functions
In the rapidly evolving field of bioinformatics, understanding the different file formats is crucial for data analysis and research. Here's a quick guide to some of the most commonly used bioinformatics files
A Practical Guide to Interpreting Proteomics Data Visualization
Data visualization plays a crucial role in analysis by bridging the gap between data and results. Mass spectrometry facilitates an in-depth examination of complex proteomes. Given the widespread availability of mass spectrometry, we are now faced with a vast amount of...
Genetic and Health Record Dataset: Insights and Translations
This summary report is based on the topic Biological and translational insights from large scale genetic and electronic health record dataset by professor Dajiang Liu.