Review and resource materials from around the internet for data science, with applications in bioinformatics and computational biology and other domains, that I’ve found useful.
Table of Contents
Resources and tips on how to self-learn and learn with others
Statistics is the study of the collection, analysis, interpretation, presentation, and organization of data.
Resources generally related to learning and understand mathematical foundations
Linear algebra is the branch of mathematics concerning vector spaces and linear mappings between such spaces.
Network science is an academic field which studies complex networks such as telecommunication networks, computer networks, biological networks, cognitive and semantic networks, and social networks, considering distinct elements or actors represented by nodes (or vertices) and the connections between the elements or actors as links (or edges).
In mathematics and computer science, an algorithm is a self-contained step-by-step set of operations to be performed.
In computer science, a data structure is a particular way of organizing and storing data in a computer so that it can be accessed and modified efficiently.
Computer programming (often shortened to programming) is a process that leads from an original formulation of a computing problem to executable computer programs.
Machine learning is the subfield of computer science that “gives computers the ability to learn without being explicitly programmed”.
Computational biology involves the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavior, and social systems.
Data visualization or data visualisation is viewed by many disciplines as a modern equivalent of visual communication. It involves the creation and study of the visual representation of data, meaning “information that has been abstracted in some schematic form, including attributes or variables for the units of information”.
Data science, computational biology, and bioinformatics papers to cover the breadth of their fields.
General knowledge mapping and exploration tools
Software engineering is the application of engineering to the development of software in a systematic method.
Reproducibility is the ability to get the same research results using the raw data and computer programs provided by the researchers.
scifigure
: Visualize Reproducibility and Replicability in a Comparison
of Scientific Studies (R package)People skills are patterns of behavior and behavioral interactions. Among people, it is an umbrella term for skills under three related set of abilities: personal effectiveness, interaction skills, and intercession skills.
Useful lists on their own that may intersect other topics above.