Category Archives: Gene Expression

Single-cell multi-omics using generative AI

Generative pre-trained models are becoming more common by the day, and such methods have now been applied to single cell sequencing, and multi-omits in general. Two main players are “scBERT” and “scGPT”: scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AIHaotian Cui, Chloe Wang, Hassaan Maan, Bo Wang – Nature Methods doi: 10.1038/s41592-024-02201-0 CODE: https://github.com/bowang-lab/scGPT scBERT as a large-scale pretrained deep language… Read More »

YOUai.ai : mini-AI engines without coding

AI is at the forefront of Society changes. We are at the dawn of a new world. Let’s hope that it will NOT be a “Brave New World“. Today I discovered (through a LinkedIn post by the group “Generative AI” which has 1.3M followers.) This brand new AI is called “youai.ai” with mostly free “mini-AI”… Read More »

Bioinformatics Training Materials

The Bioinformatics Core at the UC Davis Genome Center has all the Documentation for the workshops and courses, past and current, available on the bioinformatics training program GitHub page. The Genome Center Event Registration offers the same material in a different listing order based on the date. Most of the material is available as Github… Read More »

Next Gen sequencing in the Gobi desert

This video from Oxford Nanopore channel is showing how they use Nanopore on site in the desert, stating that “the future is already here” at time 3min40sec within the video. In the Gobi Desert, a team established a mobile lab for genome sequencing.Their objective was to investigate the microbiome of tiny mammals residing in this… Read More »

Big Book of R

www.bigbookofr.com by Oscar Baruffa This online book is a compendium list of ~300 books using R for data analysis. Fro example, one useful example contained within is Computational Genomics with R by Altuna Akalin (2020-09-30.) The big book is available on GitHub as BigBookofR. If you take time to check the site and file 020-book_list.Rmd… Read More »

When We Met Other Human Species

In the light of the new Nobel Prize nomination of Svante Pääbo it is nice to watch these youtube titles. The first one is from PBS, and offers a nice summary without too much scientific jargon or baggage. The next title is presented by Svante Pääbo that was the presentation that I saw at the… Read More »

PCA as Metro-Maps & Hierarchical Clustering on Principal Components

The iris dataset is perhaps one of the most famous datasets used to learn and teach statistics and now machine learning. Being curious about this dataset lead me last time to the TableConvert.com  web site that I discussed in my previous post here (and there.) “Metro Maps” Today while revisiting the Wikipedia link for this dataset (Iris_flower_data_set) my… Read More »

Hunting for SRA sequence archives

SRA: Sequence Read Archive The Sequence Read Archive (SRA) makes biological sequence data available to the research community to enhance reproducibility and allow for new discoveries by comparing data sets. The SRA stores raw sequencing data and alignment information from high-throughput sequencing platforms, […] However, it is rather difficult to even find the download links… and even… Read More »

omictools.com: Search engine for biological data analysis

Re: omictools.com Finding software that is relevant to any biological analysis can be inspired by reading a paper or perhaps searching within Google. The web site https://omictools.com/ contains 16,971 software (omic) tools that are organized in categories shown on the home page. Checking a bit further one can find interesting entries such as: High-throughput sequencing data analysis…… Read More »