Category Archives: Computation

DataChat: Guided English Language© Ai and ML data analysis

A new spin-off company from the University of Wisconsin-Madison proposes a new interface for data analysis based on Guided English Language© (GEL) to command Artificial Intelligence and Machine Learning methods via “utterances” i.e. English-like phrases easier to write than code whether R, python, SQL, or other traditional tools. The web site is at The… Read More »

Computing residues per turn in an alpha helix

Last month I wrote a blog on my Biochemistry Blog about computing the number of residues within an alpha helix. This was in relation to a project concerning predicting a “coiled-coil” structure. It was more complicated than I thought. The link to the original article is here: Computing residues per turn in an alpha helix… Read More »

Download Google Drive Files using wget

Cross-posting from post:  Download Google Drive Files using wget The problem: transfer file between clouds Files on Google drive can be shared between users, but the default access to the file is via we web browser graphical interface. However, sometimes it may be useful, or even necessary, to access and download a file from a command line,… Read More »

Default preference reversal in R 4.0.x

I just wrote an entry on my Biochem blog which I think would fit on this site: Default preferences I enjoy using R and RStudio, but I am always weary of upgrading R because that usually leads to some issue(s). The most recent was a bit long for me to diagnose, even though in retrospect… Read More »

in2csv: the Excel killer is part of csvkit the command-line spreadsheet

Summary csvkit is a suite of command-line tools for converting to and working with CSV, the king of tabular file formats. (csvkit can convert XLSX files to CSV.) A good docker container for csvkit: thomasleplus/csv Inspiration: stack overflow article: convert-xlsx-file-to-csv-using-batch Note: I wrote a different version of this on this post elsewhere: csvkit command-line spreadsheet can convert and… Read More »

A great Unix tutorial suitable for beginners

More than once I have written my own tutorials for learning how to use the command line within the bash shell in the context of a Unix or Unix-like computer. Recently I found a very well done tutorial that, while titled “Unix for Neuroimagers”  provides a wonderfully well done, easy to understand set of small… Read More »

PCA as Metro-Maps & Hierarchical Clustering on Principal Components

The iris dataset is perhaps one of the most famous datasets used to learn and teach statistics and now machine learning. Being curious about this dataset lead me last time to the  web site that I discussed in my previous post here (and there.) “Metro Maps” Today while revisiting the Wikipedia link for this dataset (Iris_flower_data_set) my… Read More »