Fetch newer groups & their events

With the current scrape implementation, **it only fetches events from the most active 100 tech groups** due to the default sort & pagination imposed by the Meetup Website. Hence @danielepolencic suggested to fetch newer groups and add them to DB first, then we fetch events based on the groups in the DB.

This issue should be addressed with the following solution:
<details>
<summary>Step by Step description</summary>

> **Current Implementation:**
> 
> 1. Fetch 100 most active groups & their RSS urls
> 2. Parse RSS urls to get relevant event urls
> 3. fetch event details from event urls
> 4. if events don't already exist in events table, add them to events table. otherwise update state of existing events.
> 
> 
> **Proposed New Implementation:**
> 
> _Getting groups_
> 1. get 100 newest groups
> 2. if groups don't already exist in groups table, add them to groups table. otherwise update state of existing events.
> 3. if any already exist, stop this task.
> 
> _Getting events_
> 1. based on groups table, get RSS urls, and parse them to get relevant event urls.
> 2. fetch event details from event urls
> 3. if events don't already exist in events table, add them to events table. otherwise update state of existing events.
> 

</details>

High level overview of tasks:
- [x] Configure the harvester service to continuously scrape for groups & add them to the DB until it find a group that already exists in the DB
- [x] Configure the harvester service to parse RSS from groups existing in the DB
- [ ] Check for duplication of groups & events
- [x] Add integration tests




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch newer groups & their events #77

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fetch newer groups & their events #77

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions