This chapter outlines the purpose of the six evaluation criteria and explores their role within evaluation. It explains how the criteria can be thought of as a set of complementary lenses and the way that each criterion can provide a different perspective on the intervention and its results. The chapter also considers different types of interventions that can be evaluated by applying the criteria as the basis for judging its value or merit. The final section explores how the criteria relate to other evaluation norms and standards.
Applying Evaluation Criteria Thoughtfully

2. The six criteria: Their purpose and role within evaluation
Copy link to 2. The six criteria: Their purpose and role within evaluationAbstract
What is a criterion?
Copy link to What is a criterion?A criterion is a standard or principle used in evaluation as the basis for evaluative judgement.
Each of the six criteria is summarised by a broad question, which illustrates its overall meaning. Each one represents an important element for consideration:
Relevance: Is the intervention1 doing the right things?
Coherence: How well does the intervention fit?
Effectiveness: Is the intervention achieving its objectives?
Efficiency: How well are resources being used?
Impact: What difference does the intervention make?
Sustainability: Will the benefits last?
The evaluation criteria’s purpose is to support consistent, high-quality evaluation within a common framework. They provide a normative framework with which to assess a specific intervention. The criteria can also be used in processes beyond evaluation, including defining frameworks and indicators for monitoring and results management, funding approval, strategic planning and intervention design, particularly to improve future interventions. Collectively, there is value in having commonly defined criteria that are similarly applied across interventions. The criteria also provide a consistent language across the development field, providing standardisation and allowing for comparison and learning across interventions.
The criteria should be viewed as a set of lenses through which one can understand and analyse an intervention. The criteria provide complementary perspectives, giving a holistic picture of the intervention. They encourage deeper thinking about the nature of an intervention, its implementation, process and results. Together they describe the desired attributes of interventions, make explicit assumptions and provide norms: that interventions should be relevant to the context, coherent with other interventions, achieve results in an efficient way and have positive, lasting impacts for sustainable development. The evaluation criteria also provide a widely accepted framework for developing an approach to evaluation, a comprehensive and systematic approach and a common language that is used from the very start of the evaluation process.
The criteria are related and can be seen as a complementary set, to which each criterion brings a different and unique perspective to the understanding of the intervention. The definition of each criterion presents a distinct concept (captured in the overarching question for each) and yet these concepts are in many ways interrelated. Development interventions are typically multifaceted so using the criteria relationally can help the evaluator to consider the intervention as a whole.
The evaluation criteria are not a methodology and they are not the goals that an intervention is trying to achieve. Instead they provide prompts for asking the right questions during the evaluation of an intervention. Every intervention (the object or evaluand2) is different, which is why the evaluation process needs to be flexible and the use of evaluation criteria should be thoughtful and adapted to the purpose and users. In accordance with evaluation quality standards, conclusions and recommendations about progress and performance should be based on appropriate, credible evidence. The logic, credibility and interpretation of evidence should be clear and follow the theory of change or logic of the intervention.
What types of interventions can be evaluated with these criteria?
Copy link to What types of interventions can be evaluated with these criteria?The term “intervention” is used throughout the guidance to mean the topic or object of the evaluation. It encompasses all the different types of efforts that may be evaluated using these criteria. These can be either international or domestic, aiming to support sustainable development or humanitarian goals.
An intervention may denote a project, programme, policy, strategy, thematic area, technical assistance, policy advice, an institution, financing mechanism, instrument, or other activity. It includes development interventions, humanitarian assistance, peacebuilding support, normative work and other international co-operation activities, as well as the activities of private sector actors and national governments in domestic policy contexts. The criteria have been used, for example, to evaluate topics ranging from the efficiency of national school feeding programmes, to the coherence of international support provided by different actors in a conflict-affected region. They are also used to evaluate policies or strategies (e.g. see Box 2.1). Evaluations also use the criteria when evaluating a suite of interventions – for example an evaluation of various projects supporting the education sector or a single large “intervention” lasting many years and involving multiple partners (e.g. general budget support).
The term “intervention” is used in this document as it is the most easily understood, but evaluators should take care to define clearly the topic of the evaluation – the intervention being analysed – and the evaluation scope, early on in the process. Using other more specific words related to the intervention in question – such as project or policy – is likely to be more useful and more readily understood by partners.
Box 2.1. Adapting the criteria to the intervention: Evaluation of the BMZ’s “Action Plan for the Inclusion of Persons with Disabilities”
Copy link to Box 2.1. Adapting the criteria to the intervention: Evaluation of the BMZ’s “Action Plan for the Inclusion of Persons with Disabilities”The evaluation of the German Federal Ministry for Economic Co-operation and Development’s (BMZ) Action Plan for Inclusion of Persons with Disabilities provides an interesting example of how the criteria may be adjusted to reflect the intervention being evaluated (in this case a strategic action plan) and how the criteria connect to the evaluation questions. The evaluation was designed to examine how successful the Action Plan had been in advancing the systematic mainstreaming of inclusion within German development co-operation. The evaluation was based on the following general evaluation questions, which were drawn from the Action Plan itself. Each of these general questions was broken down into more detailed questions by criteria:
1. To what extent does the BMZ set a good example in its own organisation with regard to the inclusion of persons with disabilities?
Relevance: To what extent do the selected fields of action and measures correspond to the provisions of the United Nations Convention on the Rights of Persons with Disabilities (CRPD)?
Effectiveness: To what extent is BMZ succeeding in establishing inclusive structures and practices?
2. To what extent does the Action Plan help boost the inclusion of persons with disabilities in the partner countries of German development co-operation?
Relevance: To what extent do the selected fields of action and measures correspond to the provisions of the CRPD?
Effectiveness: To what extent were persons with disabilities included more effectively in measures in the partner countries of German development co-operation?
3. To what extent does the BMZ act at the national, regional and international levels as an advocate and partner for the rights of persons with disabilities in development co-operation?
Relevance: To what extent do the selected fields of action and measures correspond to the provisions of the United Nations CRPD?
Effectiveness: To what extent has the BMZ succeeded in winning over bilateral and multilateral actors to the cause of inclusion?
4. How was the development and implementation of the Action Plan for Inclusion managed?
Effectiveness: Which management mechanisms and structures (including the roles of the various stakeholder groups in the management process) determined the development and implementation of the Action Plan?
Efficiency: To what extent were the measures implemented as planned?
5. How should we rate the benefits of the Action Plan for Inclusion in terms of its breadth of impact and leverage as a governance instrument?
Impact: To what extent did the Action Plan generate broad impact?
Impact: To what overall extent did the Action Plan create leverage?
Source: Schwedersky, Ahrens and Steckhan (2017[1]), Evaluation of the BMZ Action Plan for the Inclusion of Persons with Disabilities, https://d-nb.info/1186644206/34
Where do the criteria fit in to the broader evaluation field?
Copy link to Where do the criteria fit in to the broader evaluation field?Because every intervention (the entity being evaluated) is different, the evaluation process needs to be flexible and the use of evaluation criteria should be thoughtful and adapted to the purpose and users. The purpose of the criteria is not therefore to provide a mandatory set of rules.
The criteria do not exist in isolation. To make best use of the criteria it is important to understand where they fit in relation to other norms and standards, methodologies and institution-level guidance.
The macro level is where the core principles for evaluation are situated, such as impartiality and independence, credibility, usefulness and participation (OECD, 1991[2]) along with the Quality Standards for Development Evaluation (OECD, 2010[3]). This level also includes the ethical standards related to research and data collection. These provide the overarching standards for good practice that all evaluations should adhere to throughout the process.
The evaluation criteria are found at a meso level, providing the lenses through which the intervention is analysed and understood.
The institutional level is where each organisation adapts the criteria and translates them into their own guidelines and policies (e.g. emphasising certain criteria, mandating ratings, adding additional criteria), reflecting their own mandates and priorities.
The micro level, i.e. within the context of each individual evaluation, includes the decisions about the evaluation questions and methodologies that are employed for each evaluation, and how the criteria are applied in that context and to that specific evaluand.
Figure 2.1. How the criteria fit in with other norms and standards
Copy link to Figure 2.1. How the criteria fit in with other norms and standards
References
[3] OECD (2010), Quality Standards for Development Evaluation, DAC Guidelines and Reference Series, OECD Publishing, Paris, https://dx.doi.org/10.1787/9789264083905-en.
[2] OECD (1991), Principles for Evaluation of Development Assistance, Development Assistance Committee, OECD Publishing, Paris, https://www.oecd.org/development/evaluation/2755284.pdf (accessed on 11 January 2021).
[1] Schwedersky, T., L. Ahrens and H. Steckhan (2017), Evaluation of the BMZ Action Plan for the Inclusion of Persons with Disabilities, DEval, https://d-nb.info/1186644206/34 (accessed on 11 January 2021).
Notes
Copy link to Notes← 1. The term “intervention” is used here and throughout the document to mean the object of the evaluation (the thing that is being evaluated). See chapter 3.2, Adapting the criteria to the evaluation’s purpose, for further discussion on this point.
← 2. The term “evaluand” refers to the object of the evaluation – the thing that is being evaluated.