Package 'SGPdata' reference manual

Title:	Exemplar Data Sets for Student Growth Percentiles (SGP) Analyses
Description:	Data sets utilized by the 'SGP' package as exemplars for users to conduct their own student growth percentiles (SGP) analyses.
Authors:	Damian W. Betebenner [aut, cre], Adam R. Van Iwaarden [aut], Ben Domingue [aut]
Maintainer:	Damian W. Betebenner <[email protected]>
License:	GPL-3
Version:	28.5-0.0
Built:	2025-03-09 02:41:22 UTC
Source:	https://github.com/centerforassessment/sgpdata

Exemplar Data Sets for Student Growth Percentile (SGP) Analyses

Description

The 'SGPdata' package contains five exemplar data sets utilized by the 'SGP' package for testing and examples.

Details

Package:	SGPdata
Type:	Package
Version:	28.5-0.0
Date:	2024-12-3
License:	GPL-3
LazyLoad:	Yes
LazyData:	Yes

Longitudinal Student Assessment Data

Description

A dataset comprising a subset of five years of state assessment reading data suitable for student growth percentile and percentile growth projection/trajectory analyses. The dataset is used in examples provided in the documentation for the studentGrowthPercentiles and
studentGrowthProjections functions, and serves as an exemplar for user construction of their own datasets for growth percentile analyses.

Format

A data frame of student level observations for five years on the following variables.

ID: Student ID
GRADE_2020: Student Grade Tested 2020, possibly missing
GRADE_2021: Student Grade Tested 2021, possibly missing
GRADE_2022: Student Grade Tested 2022, possibly missing
GRADE_2023: Student Grade Tested 2023, possibly missing
GRADE_2024: Student Grade Tested 2024, possibly missing
SS_2020: Student Scale Score 2020, possibly missing
SS_2021: Student Scale Score 2021, possibly missing
SS_2022: Student Scale Score 2022, possibly missing
SS_2023: Student Scale Score 2023, possibly missing
SS_2024: Student Scale Score 2024, possibly missing

Note

All datasets used with studentGrowthPercentiles and studentGrowthProjections must be specifically formatted as wide format files. The first variable/column is the student ID variable. The next set of columns provide the grade of the student across all the years provided in the dataset (possibly missing). The last set of columns provide the scales scores of the student in the respective grades. Multi-year operational analyses benefit from putting data in long format. See sgpData_LONG and associated documentation for a comprehensive account.

Source

Anonymized student level state assessment data

Lookup table associating INSTRUCTOR_NUMBER with student ID by YEAR and CONTENT_AREA

Description

A lookup table comprising student IDs associated with INSTRUCTOR_NUMBERa subset of five years of state assessment reading and mathematics data suitable for student growth percentile and percentile growth projection/trajectory analyses. The dataset is in LONG format with each record representing a unique teacher by student by year by content area combination.

Format

A data frame of student level observations for five years and two content areas for the following variables.

ID: Unique Student Identification Number
CONTENT_AREA: Content area for student/teacher record (Reading or Mathematics)
YEAR: Year for student/teacher record
INSTRUCTOR_NUMBER: Unique instructor number identifier
INSTRUCTOR_LAST_NAME: Instructor last name
INSTRUCTOR_FIRST_NAME: Instructor first name
INSTRUCTOR_WEIGHT: Proportional weight associated with student/teacher record
INSTRUCTOR_ENROLLMENT_STATUS: Indicator of full enrollment status of student with teacher

Source

Anonymized student/teacher lookup table in LONG format

Longitudinal Student Assessment Data in LONG format

Description

A dataset comprising a subset of five years of state assessment reading and mathematics data suitable for student growth percentile (SGP) and percentile growth projection/trajectory analyses. The dataset is in LONG format with each record representing a unique student, by year by content area combination. The example data is used in examples and illustration in the enclosed documentation to show how SGPs can be operationalized across years with the results being used to produce a variety of high quality visualizations both at the aggregate (e.g., school) and individual level.

Format

A data frame of student level observations for five years and two content areas for the following variables.

ID: Unique Student Identification Number
LAST_NAME: Student last name
FIRST_NAME: Student first name
CONTENT_AREA: Content area for student observation (Reading or Mathematics)
YEAR: Year for student observation
GRADE: Grade level of student observation
SCALE_SCORE: Student Scale Score
ACHIEVEMENT_LEVEL: Achievement level associated with student scale score
GENDER: Student gender
ETHNICITY: Student ethnicity
FREE_REDUCED_LUNCH_STATUS: Student free/reduced lunch status
ELL_STATUS: Student English Language Learner status
IEP_STATUS: Student Individual Education Plan status
GIFTED_AND_TALENTED_PROGRAM_STATUS: Student Gifted and Talented Program status
SCHOOL_NUMBER: School number associated with student record
SCHOOL_NAME: School name associated with school number and student record
EMH_LEVEL: Elementary, Middle, High school indicator for school attended
DISTRICT_NUMBER: District number associated with student record
DISTRICT_NAME: District name associated with school number and student record
SCHOOL_ENROLLMENT_STATUS: Indicator of full academic year student enrollment in school
DISTRICT_ENROLLMENT_STATUS: Indicator of full academic year student enrollment in school
STATE_ENROLLMENT_STATUS: Indicator of full academic year student enrollment in school
VALID_CASE: Indicator of whether the case is valid or invalid

Note

All datasets used with studentGrowthPercentiles and studentGrowthProjections must be specifically formatted as wide format files. The first variable/column is the student ID variable. The next set of columns provide the grade of the student across all the years provided in the dataset (possibly missing). The last set of columns provide the scales scores of the student in the respective grades. Operational users of this package are advised to use data formatted similar to the sgpData_LONG with the higher level functions prepareSGP, analyzeSGP, combineSGP, summarizeSGP, visualizeSGP, and outputSGP

Source

Anonymized student level state assessment data in LONG format

Longitudinal Student Assessment Data in LONG format with COVID-19 2020 interruption

Description

A dataset comprising a subset of seven years of state assessment ELA and mathematics data across an 8 year span suitable for student growth percentile (SGP) and percentile growth projection/trajectory analyses. The dataset is missing 2020 data to help users model COVID related interuptions to student growth. The dataset is in LONG format with each record representing a unique student, by year by content area combination. The example data is used in examples and illustration in the enclosed documentation to show how SGPs can be operationalized across years with the results being used to produce a variety of high quality visualizations both at the aggregate (e.g., school) and individual level.

Format

A data frame of student level observations for seven years (across an 8 year span) and two content areas for the following variables.

VALID_CASE: Indicator of whether the case is valid or invalid
CONTENT_AREA: Content area for student observation (ELA or Mathematics)
YEAR: Year for student observation
ID: Unique Student Identification Number
IEP_STATUS: Student Individual Education Plan status
GRADE: Grade level of student observation
SCALE_SCORE: Student Scale Score
SCALE_SCORE_without_COVID_IMPACT: Student Scale Score without COVID impact. Original scale score
ACHIEVEMENT_LEVEL: Achievement level associated with student scale score
ETHNICITY: Student ethnicity
FREE_REDUCED_LUNCH_STATUS: Student free/reduced lunch status
ELL_STATUS: Student English Language Learner status
GENDER: Student gender
SCHOOL_NUMBER: School number associated with student record
SCHOOL_NAME: School name associated with school number and student record
DISTRICT_NUMBER: District number associated with student record
DISTRICT_NAME: District name associated with school number and student record

Note

All datasets used with studentGrowthPercentiles and studentGrowthProjections must be specifically formatted as wide format files. The first variable/column is the student ID variable. The next set of columns provide the grade of the student across all the years provided in the dataset (possibly missing). The last set of columns provide the scales scores of the student in the respective grades. Operational users of this package are advised to use data formatted similar to the sgpData_LONG_COVID with the higher level functions prepareSGP, analyzeSGP, combineSGP, summarizeSGP, visualizeSGP, and outputSGP

Source

Anonymized and altered student level state assessment data in LONG format

Longitudinal Student Assessment Data in LONG format with DATE for SGPt analyses.

Description

A dataset comprising a subset of 3 years (and time periods) of interim assessment early literacy, reading and mathematics data suitable for time dependent student growth percentile (SGPt) and percentile growth projection/trajectory analyses. The dataset is in LONG format with each record representing a unique student, by year/time period by content area combination. The example data is used in examples and illustrations in the enclosed documentation to show how SGPts can be operationalized across years with the results being used to produce a variety of high quality visualizations both at the aggregate (e.g., school) and individual level.

Format

A data frame of student level observations for five years and two content areas for the following variables.

COUNTRY: Variable indicating the COUNTRY associated with test record.
STATE: State associated with the test record.
VALID_CASE: VALID_CASE variable.
CONTENT_AREA: Variable indicating content area associated with test record.
YEAR: Variable indicating the year associated with the test record.
DATE: Variable indicating the year associated with the test record: Must be in YYYY-MM-DD format.
ID: Unique student identifier associated with test record.
GRADE: Variable indicating the grade associated with the test record.
SCALE_SCORE: Variable providing the scale score associated with the test record.
SCALE_SCORE_RASCH: Variable providing the Rasch score associated with the test record.
SEM: Variable indicating the standard error of measurement associated with the scale score.
ACHIEVEMENT_LEVEL: Variable indicating the achievement level associated with the scale score.

Source

Anonymized student level interim assessment data in LONG format

Package 'SGPdata'

Help Index

Exemplar Data Sets for Student Growth Percentile (SGP) Analyses

Description

Details

Longitudinal Student Assessment Data

Description

Format

Note

Source

See Also

Lookup table associating INSTRUCTOR_NUMBER with student ID by YEAR and CONTENT_AREA

Description

Format

Source

See Also

Longitudinal Student Assessment Data in LONG format

Description

Format

Note

Source

See Also

Longitudinal Student Assessment Data in LONG format with COVID-19 2020 interruption

Description

Format

Note

Source

See Also

Longitudinal Student Assessment Data in LONG format with DATE for SGPt analyses.

Description

Format

Source

See Also