Back to Projects

Team Name:

LNTP


Team Members:


Evidence of Work

Clustering NSW Senior Highschools by Socio-economics Factors

Project Info

Team Name


LNTP


Team Members


2 members with unpublished profiles.

Project Description


Team Members and Team Captain

  • Team Member: Thang Phung
  • Team Captain: Xuan Cuong Nguyen

Project Description

This project aims to uncover patterns of educational inequity among schools by analyzing socio-economic and demographic data. Using key indicators such as ICSEA (socio-educational advantage), FOEI (socio-economic disadvantage), LBOTE (language background other than English), and Indigenous representation, the project groups schools into clusters to reveal disparities in resource allocation and educational outcomes. The results provide insights into which schools are most disadvantaged and where targeted interventions may be needed to improve educational equity.


Introduction:

Education is a powerful driver of social mobility, yet not all students have equal access to high-quality educational resources. Using a dataset that includes socio-economic indicators such as ICSEA (socio-educational advantage), FOEI (socio-economic disadvantage), LBOTE (language background other than English), and Indigenous representation, this project aims to uncover patterns of inequity among schools. By applying K-Means clustering, we segmented schools into distinct groups based on their socio-economic profiles, allowing for a deeper understanding of educational disparities and where interventions are most needed.


The Findings:

Our analysis revealed several important insights:

  • Clear Disparities: Schools with lower ICSEA values (less socio-educational advantage) also had higher FOEI values (more socio-economic disadvantage). These schools were predominantly in clusters with higher percentages of Indigenous and LBOTE students, highlighting a pattern of educational inequality.

  • Inverse Trends: We found a strong inverse relationship between ICSEA and FOEI values—schools with more socio-educational advantage faced less socio-economic disadvantage.

  • Targeted Interventions Needed: Schools in the most disadvantaged clusters, with high FOEI values and low ICSEA values, would benefit the most from targeted interventions, such as additional funding, language support, and culturally tailored programs for Indigenous students.

FOEI over ICSEA

Scatterplot


Conclusion:

This project tells a compelling data story about educational inequities. By clustering schools based on their socio-economic and demographic profiles, we have provided a clear, data-driven case for addressing educational disparities and guiding policy interventions that can make a meaningful impact.


#opendata #nsw #digital #machinelearning #ml #classification #clustering #equity #data #education #socio-economics

Data Story


Data Story: Identifying Educational Inequities Through Socio-Economic Clustering

The Journey of the Data:

The project started with school-level data, capturing key socio-economic and demographic indicators. We replaced missing values and standardized the data to ensure consistency across all schools. Our key metrics—ICSEA, FOEI, LBOTE_pct, and Indigenous_pct—were chosen because they strongly correlate with student outcomes and educational equity. We used a data sampling methods and other preprocessing techniques to ensure the data is fit for machine learning purposes.

We then applied K-Means clustering to group schools into clusters based on their socio-economic and demographic profiles. This allowed us to create distinct groups that showed how certain schools, often serving vulnerable communities, are socio-economically disadvantaged.

Additionally, we created a web application to display the schools and their classification visually on a map. We believe this is the most intuitive way of viewing the data, and would be a great help in understanding the numbers analyzed by our Machine Learning model.


Impact and Next Steps:

This analysis highlights the urgent need for equitable resource allocation. The clusters we identified show that certain groups of schools face systemic disadvantages that affect their ability to provide quality education. The data story is clear: educational inequity is tightly linked to socio-economic factors, and understanding these relationships can guide better policy decisions.

Going forward, this model can be used to:
- Predict future school outcomes based on socio-economic trends.
- Direct funding and resources more effectively to the schools and regions that need it most.
- Provide insights for long-term planning and policy reforms to ensure all students have equal opportunities to succeed.


Evidence of Work

Video

Homepage

Project Image

Team DataSets

NSW public schools master dataset

Description of Use These metrics help us understand the socio-economic factors that influence student demographics and educational equity and create meaningful groupings of schools that share similar socio-economic profiles

Data Set

Challenge Entries

Innovating Wellbeing Measurement – Uncovering New Connections

We want to explore different ways to link data and outcomes to domains and indicators under the <a href="https://www.act.gov.au/wellbeing/wellbeing-framework/domains-and-indicators"> ACT Government’s Wellbeing Framework.</a> Exploring external data sources can provide broader insights into the factors influencing our wellbeing, and improve its measurement.

Go to Challenge | 17 teams have entered this challenge.

Factors that influence education, skills and training choices of young people

What factors impact the decisions of young people to commence and complete post school studies (Vocational Education and Training or higher education), including those that commence and complete an apprenticeship?

Go to Challenge | 17 teams have entered this challenge.

Connecting Communities: Your Guide to local Information and Services

How can we assist local residents and visitors in easily locating local services and providing answers to frequently asked questions?

Go to Challenge | 29 teams have entered this challenge.

Place-based insights unlocked: Generative AI x Digital Atlas of Australia

How can we use the Digital Atlas of Australia’s API and Generative AI to create innovative, user-friendly tools and visualisations that make geospatial data accessible to everyone, empower decision-making, and help all Australians better engage with their local and national environments?

Go to Challenge | 15 teams have entered this challenge.