Indonesian History Test Found Too Easy, Study Reveals Gaps in Measuring Student Ability


Image Created by AI

A 2026 study by Suwarto, Farida Nugrahani, and Singgih Subiyantoro from Universitas Veteran Bangun Nusantara finds that Indonesia’s national history test may be too easy to accurately measure students’ abilities. Published in the International Journal of Applied Research and Sustainable Sciences (IJARSS), the research highlights a mismatch between student competence and question difficulty, raising concerns about the effectiveness of current evaluation systems.

The findings matter because history education plays a key role in shaping national identity, especially among younger generations facing rapid globalization. If assessment tools fail to measure real understanding, educators and policymakers risk overestimating students’ grasp of historical knowledge and values.

Why This Research Matters

In Indonesia, history tests are often treated as routine academic requirements rather than tools to evaluate critical thinking and national awareness. As cultural boundaries blur in the digital era, ensuring that students genuinely understand the nation’s historical struggles becomes increasingly important.

Previous observations suggest that many educational assessments have not undergone rigorous quality testing. This raises doubts about whether such exams can distinguish between students who truly understand history and those who rely on memorization alone. The study by Suwarto, Farida Nugrahani, and Singgih Subiyantoro addresses this gap by focusing on the quality of the test itself, rather than the content of history education.

How the Study Was Conducted

The research used a quantitative, descriptive-exploratory design involving 200 university students from urban–rural transition areas. These participants were selected from a larger population of 1,500 first-semester students at Universitas Veteran Bangun Nusantara.

Students completed a multiple-choice test covering Indonesian national history topics, from the early nationalist movement to independence. The researchers then analyzed the responses using the Rasch Model, a modern measurement approach that evaluates both question difficulty and participant ability on the same scale.

Data analysis was carried out using the Quest software, which allowed the team to assess:

  • The reliability of the test
  • The consistency of student responses
  • Whether individual questions functioned as intended

Key Findings

The study reveals several important insights into the quality of the Indonesian history test:

  • High reliability overall
    The test shows strong internal consistency, with a person reliability score of 0.92 (very good) and item reliability of 0.87 (good).
  • Students outperform test difficulty
    The average student ability is significantly higher than the average difficulty of the questions.
  1. Mean student ability: 1.47 logit
  2. Mean item difficulty: 0.00 logit
This indicates that most questions are too easy for the participants.
  • Majority of questions need improvement
    Only 12 out of 30 questions meet the acceptable standards for measurement. The remaining 18 questions show inconsistent response patterns, meaning they may not accurately measure student ability.
  • Uneven distribution of question difficulty
    Visual analysis using a Wright Map shows a gap between question difficulty levels and student abilities. Many questions cluster around similar difficulty levels, leaving some ability ranges unmeasured.
  • Background differences influence results
    Students from urban and rural backgrounds may interpret questions differently due to variations in educational access and exposure to historical information.

What the Findings Mean

The study suggests that the current history test does not effectively capture the full range of student abilities. While the test is reliable, it lacks the complexity needed to challenge higher-performing students or identify those with lower understanding.

Suwarto from Universitas Veteran Bangun Nusantara explains that a good history assessment should go beyond memorization. It should measure deeper understanding, including the ability to analyze historical events and connect them to national values.

In an academic interpretation, the authors note that misfitting questions—those that do not align with expected response patterns—may reflect ambiguity or differences in educational background. This highlights the need for more carefully designed test items that function consistently across diverse student groups.

Implications for Education and Policy

The findings have important implications for multiple stakeholders:

For educators
Teachers need better-designed questions that assess critical thinking and historical reasoning, not just factual recall. More varied and challenging questions can improve the accuracy of student evaluation.

For policymakers
Education authorities should consider adopting modern measurement approaches like the Rasch Model to improve national assessment standards. Reliable and balanced tests are essential for fair and meaningful evaluation.

For researchers
The study opens opportunities for further investigation, including:

  • Bias analysis between urban and rural students
  • The impact of digital literacy on historical understanding
  • Development of adaptive testing systems

For students and society
Better assessment tools can lead to stronger historical awareness, which is crucial for maintaining national identity and civic responsibility in a globalized world.

Recommendations from the Study

The research team proposes several practical steps:

  • Revise or remove poorly performing questions
  • Develop a broader range of question difficulty levels
  • Create adaptive question banks based on student ability
  • Conduct deeper analysis of question bias across demographic groups

These changes aim to ensure that history tests measure student competence more accurately and fairly.

Source

Suwarto, Nugrahani, F., & Subiyantoro, S. (2026). Characteristics of the History of Indonesian National Struggle Test. International Journal of Applied Research and Sustainable Sciences (IJARSS), Vol. 4 No. 3, 275–284.
DOI: https://doi.org/10.59890/ijarss.v4i3.221

Posting Komentar

0 Komentar