Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
During a recent Dask tutorial someone asked "Can Datashader run on Dask?" and I was pleased.
It would be good to have an easy-to-run example that was advertised Datashader from Dask resources. Often we do this in examples.dask.org . Is there an example that makes sense to deploy there?
Malastare.ai is a startup Analytics Consulting Firm. Based in Texas, USA. We combine deep industry knowledge with specialized expertise in analytics, strategy, operations, and risk management. We leverage our clients' real-world experience, industry best practices and technology best practices to enable them to succeed in their big data projects.
After reading many article that used data from Apple's health app. I wanted to try to create some data visualizations of test set count dataset. I used python, matplotlib and seaborn to analyze the step counts from the test data. I have included the work in a jupyter notebook file and the csv file that has the test data used for this side project.
Over the past months, we have seen a significant racial justice reckoning happening across the country since the killing of George Floyd by a police officer in May 2020. This incident sparked a redirection of attention to similar lives that had been lost at the hands of officers, leading to calls for re-evaluation of the role and power that police hold. In order for stakeholders like activism groups and local policymakers to make the most change in the quickest and most effective manner in response to these calls, the data code and report strived to answer a questions that will enable this. The primary tool used was R, with ggplot and machine learning packages.
This is a Repository made for Coursera Assignments, and Tutorials which includes many interesting plots such as waffle charts, folium charts, chloropeth charts etc.
An analysis of my Facebook social network using a Gephi network graph visualization to determine the number and types of communities I am a part of online.
I analyze and explore US Census Bureau Data using Data Visualization techniques to identify salient features useful for predicting an individual's income level. We use those relevant features and multiple classification methods (Decision-Tree, SVM, and K-Nearest Neighbor) to predict the income level for unknown individuals. Our client is a local University who wants to use income as the key demographic to decide criteria for marketing its degree programs. Each classifier explored has an accuracy of over 85%.
During a recent Dask tutorial someone asked "Can Datashader run on Dask?" and I was pleased.
It would be good to have an easy-to-run example that was advertised Datashader from Dask resources. Often we do this in examples.dask.org . Is there an example that makes sense to deploy there?