Data science is an interdisciplinary field that involves the extraction, analysis, and interpretation of large and complex data sets using a combination of statistical, computational, and machine learning techniques. Data science can involve working with structured data, such as data in tables or spreadsheets, or unstructured data, such as text, images, or video.
Data science encompasses a variety of methods and tools for working with data, including data mining, data visualization, statistical modeling, and machine learning. These methods are used to extract insights and knowledge from data, which can then be used to inform decision-making in a range of fields, from business and finance to healthcare and social sciences.
Data scientists typically have strong skills in mathematics, statistics, and programming, as well as domain expertise in the field they are working in. They work with large data sets to clean, preprocess, and transform the data, and then use statistical methods to analyze and interpret the data. Machine learning techniques, such as classification, regression, and clustering, can be used to build predictive models or identify patterns in the data.
![]() |
Atlan Data WikiExplore an online encyclopedia that invites users to learn about different elements of data and contribute new entries. | FreeBrowser | ![]() |
Data Science, Glossary, Learning |
![]() |
GlosarioExplore and learn more about common data science vocabulary using this glossary (also available in multiple other languages) | FreeBrowser | ![]() |
Data Science, Glossary, Learning |
![]() |
ConstellateAnalyze text and data in a platform that also serves as a tool for classroom teaching of analytics skills. | FreemiumBrowser | ![]() |
Data Science, Learning, Text Analysis |
![]() |
ParaViewVisualize and analyze large datasets. | Open Source, FreeLinux, Mac, Windows, Server | ![]() |
Data Science |
![]() |
OrangeExplore this data science toolkit for data mining, data visualization and machine learning using a visual programming language. Advertised as useful for teachers and learners in Data Science. | Open SourceLinux, Mac, Windows | ![]() |
Data Science, Learning |
![]() |
Posit (formerly R Studio) Use RStudio's integrated development environment to execute code written in R. |
One-time cost, Open SourceBrowser, Linux, Mac, Server, Windows | ![]() |
Data Science, Programming |
![]() |
Posit Cloud (formerly R Studio Cloud) Use a cloud based version of RStudio for teaching and learning data science and analyzing and sharing data from the RStudio integrated development environment. |
Free, SubscriptionBrowser | ![]() |
Data Science, Programming |
![]() |
KaggleFind datasets or publish them in a cloud-based platform which enables users to share code, datasets and analyze data in Python, R and R Markdown, and provides a platform for AI education and machine learning competitions. | FreeBrowser | ![]() |
Data Science, Machine Learning, Programming, Repository |
![]() |
KNIMEAnalyze and visualize data. | Open Source, SubscriptionBrowser, Linux, Mac, Server, Windows | ![]() |
Data Science |