Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
This is the repository of the ElasTest Monitoring Service, which provides a monitoring infrastructure suitable for inspecting executions of a SuT (System under Test) and the ElasTest platform itself online.
https://github.com/deib-polimi/TRex. T-Rex is a general-purpose Complex Event Processing (CEP) Infrastructure designed to support an expressive language for rule definition while offering efficient processing mechanisms. This repository provides T-Rex engine in Java.