Skip to content

sportsdataverse/fastRhockey-pwhl-raw

Repository files navigation

fastRhockey-pwhl-raw

Raw PWHL game JSON data scraped from the HockeyTech API via fastRhockey.

  graph LR;
    A[fastRhockey-pwhl-raw]-->B[fastRhockey-pwhl-data];
    B[fastRhockey-pwhl-data]-->C1[pwhl_pbp];
    B[fastRhockey-pwhl-data]-->C2[pwhl_player_boxscores];
    B[fastRhockey-pwhl-data]-->C3[pwhl_rosters];
    B[fastRhockey-pwhl-data]-->C4[pwhl_schedules];

Loading

fastRhockey PWHL workflow diagram

flowchart TB;
    subgraph A[fastRhockey-pwhl-raw];
        direction TB;
        A1[scripts/daily_pwhl_scraper.sh]-->A2[R/scrape_pwhl_raw.R];
    end;

    subgraph B[fastRhockey-pwhl-data];
        direction TB;
        B1[scripts/daily_pwhl_R_processor.sh]-->B2[R/pwhl_data_creation.R];
    end;

    subgraph C[sportsdataverse Releases];
        direction TB;
        C1[pwhl_pbp];
        C2[pwhl_player_boxscores];
        C3[pwhl_rosters];
        C4[pwhl_schedules];
    end;

    A-->B;
    B-->C1;
    B-->C2;
    B-->C3;
    B-->C4;

    click C1 "https://github.com/sportsdataverse/sportsdataverse-data/releases/tag/pwhl_pbp" _blank;
    click C2 "https://github.com/sportsdataverse/sportsdataverse-data/releases/tag/pwhl_player_boxscores" _blank;
    click C3 "https://github.com/sportsdataverse/sportsdataverse-data/releases/tag/pwhl_rosters" _blank;
    click C4 "https://github.com/sportsdataverse/sportsdataverse-data/releases/tag/pwhl_schedules" _blank;

Loading

sportsdataverse-data releases

Release tag Content
pwhl_pbp PWHL play-by-play data
pwhl_player_boxscores PWHL player box scores (skaters + goalies)
pwhl_rosters PWHL rosters
pwhl_schedules PWHL schedules

Structure

pwhl/
├── json/
│   ├── raw/              # Raw HockeyTech API responses per game
│   └── final/            # Processed via fastRhockey pipeline (PBP, box scores, game info)
├── schedules/
│   ├── rds/              # Season schedules (pwhl_schedule_{year}.rds)
│   └── parquet/          # Season schedules in parquet format
├── pwhl_schedule_master.rds      # Combined schedule across all seasons
└── pwhl_schedule_master.parquet

Data Sources

  • HockeyTech statviewfeed — play-by-play, game summary, schedule
  • HockeyTech gc feed — game center summary (scoring, penalties, shots, three stars)

Automation

  • Scraping workflow runs daily during the PWHL season (Nov-May)
  • On push, triggers the fastRhockey-pwhl-data repo to compile datasets

Related repositories

fastRhockey-pwhl-raw data repository (source: HockeyTech API)

fastRhockey-pwhl-data repository (source: HockeyTech API)

fastRhockey-nhl-raw data repository (source: NHL API)

fastRhockey-nhl-data repository (source: NHL API)

fastRhockey-data legacy repository (archived; sources: NHL Stats API + PHF)

Part of the SportsDataverse

About

fastRhockey PWHL raw JSON data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors