First Data Science Challenge Write Up - Discovering Top Technology Meetups in Vancouver.

Challenge

My friend Andy Key and I introduced a new feature series, Data Science Challenges (DSC).

The purpose of DSC is to:

  1. Challenge us to use and expand our Software Engineering and Data Science skills
  2. Apply these skills to solve problems and discover new insights
  3. Build a portfolio of projects that range the full gamut of the Data Science Lifecycle

Our first challenge was to use the Meetup API to explore meetups in our city of choice.

For my challenge, I chose to explore Technology Meetup Groups in my city, beautiful Vancouver, BC.

Exploring Vancouver Tech Meetups

I wanted to explore the group’s various traits (number of members, location, compare smallest vs largest groups, group ratings, etc) and look at how often the largest groups hosted events and their attendance numbers. My intention was to discover and learn more about the groups and potentially come up with a list of recommended groups to join.

Check out my notebook for the full details.

This blog post covers:

  • Recommendations
  • Discoveries
  • How I tackled this challenge
  • Conclusion
  • How you can join Data Science Challenges

Without further delay, let’s see our recommendations.

Recommendations

So what are my recommendations for which of the largest and most active Tech Meetup groups you should join?

4th Place

If you are into Web Development and Design and don’t mind joining a group that meets once every few months, but that attracts a large number of attendees, you should join the following groups:

3rd Place

If you want to join a Tech Meetup Group that meets at least once a month and has a lot of attendees, you should join the following groups:

2nd Place

If you want to join an Entrepreneurship and Startups focused Meetup Group that meets several times a month and has a decent but sometimes inconsistent attendance, you should join the following group:

1st Place

If you are into Data Science and want to join a Meetup Group that meets several times a week and has lots of attendees, you should join the following group:

Learn Data Science MeetupLearn Data Science Meetup

Discoveries

Note: For the purposes of this analysis, the largest and smallest groups were defined below:

  • Largest groups were defined as groups with more than 10x the size of the median number of members
  • Smallest groups were defined as groups with less than 1/10th of the median number of members

With the median being 295, the largest groups had more than 3000 members while the smallest groups had less than 30 members.

Size of Groups

  • Average number of members is 680
  • Almost 80% of groups have less than 900 members
Members Histogram Members Histogram
Members Frequency Table Members Frequency Table

Largest vs. Smallest Groups

  • The largest groups tended to be centered around Downtown and the surrounding areas while the smallest groups were more spread across the city.
Location of Smallest and Largest Tech GroupsLocation of Smallest and Largest Tech Groups
  • Largest groups were not rated higher than smaller groups
  • Most groups were rated very high (>4.0)
Member vs Ratings for Largest vs Non-Largest Groups Member vs Ratings for Largest vs Non-Largest Groups
  • The smallest groups had a rating of 0
All 30 Smallest Groups had 1 Rating: 0 All 30 Smallest Groups had 1 Rating: 0

Largest Group Events

The following Meetups meet frequently (once to multiple times a month) and are either maintaining or increasing their attendance:

  • Agile Vancouver
  • Full Indie
  • HackerNest Vancouver Tech Socials
  • NetSquared Vancouver: Tech4Good and Nonprofits
  • TechVancouver
  • Learn Data Science
Yes RSVP’s over Time Plot — Data and Regression Model FitsYes RSVP’s over Time Plot — Data and Regression Model Fits

How did I go about it?

Going into this challenge, I had never used the Meetup API before, so the first step was to understand the Meetup API Module and the available methods. In order to get Tech Meetup groups in Vancouver, I had to provide the Category ID, City and Country Code. Figuring out the correct values for these and understanding how the Meetup API v2’s response payloads returned pages was the preliminary activity.

After some trial and error and a couple of helper functions later, I had all the Tech Groups based in Vancouver, BC and a few of the surrounding cities. Next, I did some initial data exploration. Looking at parts of the data, the uniqueness, data ranges, and tendencies of each column allowed me to make more discoveries, raise more questions and clean up my data.

Once the initial clean up was done, I was able to:

  • Look at the size of the groups via plots and frequency tables
  • Compare largest and smallest groups
  • Compare ratings of the largest groups against the other groups
  • Look at groups with no ratings

One of the indicators of a good Meetup group to join is the frequency and attendance of the events they host. Focusing exclusively on the 17 largest groups, I looked at the events they hosted over the past 6 months. After cleaning up the events data, I merged it with the groups' data and started analyzing the data. This analysis led to my recommended list of Meetups to join.

Conclusions

This was a very exciting and rewarding project with some interesting findings. It was great to utilize both my data (exploration, visualization, cleaning) and software engineering skills to learn more about the Meetups in my city.

There are definitely additional areas to explore, and I hadn’t answered all the questions I had raised. The fact that I focused exclusively on the largest Meetups also makes this recommendation list a bit biased. Surely there are smaller Meetups which meet frequently enough to recommend people. I will treat that and the additional questions as project extension ideas for the future.

For now, it’s on to the next Data Science Challenge!

So how do I do the Data Science Challenges thingy?

Oh, you’re still reading? Well, if you are comfortable with Programming, Statistics, Data Science and are looking to take your skills to the next level by applying your them to solve open-ended problems that interest you, I have written an excellent introduction that will help you get started.