Version control is one of the great skills that a software developer should have. Though there is a lot to learn, the basics are readily available on the internet to learn. Commands are well docume...
Scheduling Tasks on Linux with CRON
We have the ability to create processes to simplify our tasks immensely as programmers. Able to turn whole paragraphs of text (a script) into a few words (a command) gives great flexibility, effici...
Generating Unique Identifiers in Python
Automating database processes are easy to do in Python. There are multiple packages allowing for interaction with different kinds of databases such as SQLalchemy. Thus we can place information from...
Negative Selection in R
R is a statistical programming language well suited for data wrangling and analysis. For many data preparation steps we need to remove data columns. In many languages we achieve this by selecting t...
A Basic NGS Data Analysis Workflow
Next Generation Sequencing (NGS) data has slowly become the standard in bioinformatics, replacing PCR and microarray expression data. Though all three hold genetic information, NGS data is unique i...
Making Aliases
Many commands in the linux terminal environment are highly customizable using flags and options. Often we use the same commands with the same settings. Aliases can be used to set the flags and opti...
Showing Significance in R Plots
R is an amazing programming language for statistical analysis. With the extension of the ggplot2 package it is relative easy to generate meaningful plots. These plots are great for visualizing and ...
Tools are just that... Tools
While working and learning I noticed a peculiarity in discussions, tutorials, and general advice. A strange focus on the the tools instead of the problem at hand is usually there. This is mostly be...
Dumping Print to Files in R using Sink
Writing data to files is an essential step in all data analysis. To save your mutated data, to make the data available for other, or to simply view the data, all of these are valid reasons. R give...
Priority Stacks: A Time Management System
There is always a limit to time, and we cannot buy easily more of it. Thus we try to efficiently manage our time. By efficiently making use of our time we can try to reduce stress and ensure we do ...