2  Data

2.1 Description

The data we use is provided from National Basketball Association (NBA). All the data is from NBA official, and the data is publicly available and can be accessed at the following URL: https://www.nba.com/stats. From the website, we mainly use the basic stats of every NBA teams and every NBA players from season 2023-24.

The data of NBA teams have attributes like the number of wins and losses games in this season, average scores per game, average minutes played per game, and son on. It has 26 attributes and there are total 30 teams. The data of NBA players have about 500 players and 26 attributes like number of wins and losses games in this season, average scores per game, and so on. The attributes of two data are the same.

The data is update daily in the current seasons and the data of previous season will not be updated. Since the data is publicly available, we directly copy the stats from the website. The precise links of two data are https://www.nba.com/stats/players/traditional?Season=2023-24 and https://www.nba.com/stats/teams/traditional?Season=2023-24.

2.2 Missing value analysis

There is no missing value in both data as the following two graphs show.

Code
library(naniar)
players_data_2023_2024 <- read.csv("data/NBA_players_2023-2024csv.csv")
vis_miss(players_data_2023_2024)

It shows there is no missing value in the data of NBA players.

Code
library(naniar)
team_data_2023_2024 <- read.csv("data/NBA_team_2023-2024csv.csv")
vis_miss(team_data_2023_2024)

It shows there is no missing value in the data of NBA teams.