Unpacking the logic behind the magic in the use of AI for anticorruption screening (re Pastor Sanz, 2022)

‘Network of public contracts, contracting bodies, and awarded companies in Spain’ in 2020 and 2021; Pastor Sanz (2022: 7).

[Note: please don’t be put off by talk of complex algorithms. The main point is precisely that we need to look past them in this area of digital governance!].

I have read a new working paper on the use of ‘blackbox algorithms’ as anti-corruption screens for public procurement: I Pastor Sanz, ‘A New Approach to Detecting Irregular Behavior in the Network Structure of Public Contracts’. The paper aims to detect corrupt practices by exploiting network relationships among participants in public contracts. The paper implements complex algorithms to support graphical analysis to cluster public contracts with the aim of identifying those at risk of corruption. The approach in the paper would create ‘risk maps’ to eg prioritise the investigation of suspected corrupt awards. Such an approach could be seen to provide a magical* solution to the very complex issue of corruption monitoring in procurement (or more generally). In this post, I unpack what is behind that magic and critically assess whether it follows a sound logic on the workings of corruption (which it really doesn’t).

The paper is technically very complex and I have to admit to not entirely understanding the specific workings of the graphical analysis algorithms. I think most people with an interest in anti-corruption in procurement would also struggle to understand it and, even data scientists (and even the author of the paper) would be unable to fully understand the reasons why any given contract award is flagged as potentially corrupt by the model, or to provide an adequate explanation. In itself, this lack of explainability would be a major obstacle to the deployment of the solution ‘in the real world’ [for discussion, see A Sanchez-Graells, ‘Procurement Corruption and Artificial Intelligence: Between the Potential of Enabling Data Architectures and the Constraints of Due Process Requirements’]. However, perhaps more interestingly, the difficulty in understanding the model creates a significant additional governance risk in itself: intellectual debt.

Intellectual debt as a fast-growing governance risk

Indeed, this type of very complex algorithmic approach creates a significant risk of intellectual debt. As clearly put by Zittrain,

‘Machine learning at its best gives us answers as succinct and impenetrable as those of a Magic 8-Ball – except they appear to be consistently right. When we accept those answers without independently trying to ascertain the theories that might animate them, we accrue intellectual debt’ (J Zittrain, ‘Intellectual Debt. With Great Power Comes Great Ignorance’, 178).

The point here is that, before relying on AI, we need to understand its workings and, more importantly, the underlying theories. In the case of AI for anti-corruption purposes, we should pay particular attention to the way corruption is conceptualised and embedded in the model.

Feeding the machine a corruption logic

In the paper, the model is developed and trained to translate ‘all the public contracts awarded in Spain in the years 2020 and 2021 into a bi-dimensional map with five different groups. These groups summarize the position of a contract in the network and their interactions with their awarded companies and public contracting bodies’ (at 14). Then, the crucial point from the perspective of a corruption logic comes in:

‘To determine the different profiles of the created groups in terms of corruption risk, news about bad practices or corruption scandals in public procurements in the same period (years 2020 and 2021) has been used as a reference. The news collection process has been manual and the 10 most important general information newspapers in Spain in terms of readership have been analyzed. Collected news about irregularities in public procurements identifies suspicions or ongoing investigations about one public contracting body and an awarded company. In these cases, all the contracts granted by the Public Administration to this company have been identified in the sample and flagged as “doubtful” contracts. The rest of the contracts, which means contracts without apparent irregularities or not uncovered yet, have been flagged as “normal” contracts. A total of 765 contracts are categorized as “doubtful”, representing 0.36% of total contracts … contracts belong to only 25 different companies, where only one company collects 508 granted contracts classified as “doubtful”’ (at 14-15, references omitted and emphasis added).

A sound logic?

This reflects a rather cavalier attitude to the absence of reliable corruption data and to difficulties in labelling datasets for that purpose [for discussion, again, see A Sanchez-Graells, ‘Procurement Corruption and Artificial Intelligence: Between the Potential of Enabling Data Architectures and the Constraints of Due Process Requirements’].

Beyond the data issue, this approach also reflects a questionable understanding of the mechanics of corruption. Even without getting into much detail, or trying to be exhaustive, it seems that this is a rather peculiar approach, perhaps rooted in a rather simplistic intuition of how tenderer-led corruption (such as bribery) could work. It seems to me to have some rather obvious shortcomings.

First, it treats companies as either entirely corrupt or not at all corrupt, whereas it seems plausible that corrupt companies will not necessarily always engage in corruption for every contract. Second, it treats the public buyer as a passive agent that ‘suffers’ the corruption and never seeks, or facilitates it. There does not seem to be any consideration to the idea that a public buyer that has been embroiled in a scandal with a given tenderer may also be suspicious of corruption more generally, and worth looking into. Third, in both cases, it treats institutions as monolithic. This is particularly problematic when it comes to treating the ‘public administration’ as a single entity, specially in an institutional context of multi-level territorial governance such as the Spanish one—with eg potentially very different propensities to corruption in different regions and in relation to different (local/not) players. Fourth, the approach is also monolithic in failing to incorporate the fact that there can be corrupt individuals within organisations and that the participation of different decision-makers in different procedures can be relevant. This can be particularly important in big, diversified companies, where a corrupt branch may have no influence on the behaviour of other branches (or even keep its corruption secret from other branches for rather obvious reasons).

If AI had been used to establish this approach to the identification of potentially corrupt procurement awards, the discussion would need to go on to scrutinise how a model was constructed to generate this hypothesis or insight (or the related dataset). However, in the paper, this approach to ‘conceptualising’ or ‘labelling corruption’ is not supported by machine learning at all, but rather depends on the manual analysis and categorisation of news pieces that are unavoidably unreliable in terms of establishing the existence of corruption, as eg the generation of the ‘scandals’ and the related news reporting is itself affected by a myriad of issues. At best, the approach would be suitable to identify the types of contracts or procurement agents most likely to attract corruption allegations and to have those reported in the media. And perhaps not even that. Of course, the labelling of ‘normal’ for contracts not having attracted such media attention is also problematic.

Final thoughts

All of this shows that we need to scrutinise ‘new approaches’ to the algorithmic detection of corruption (or any other function in procurement governance and more generally) rather carefully. This not only relates to the algorithms and the related assumptions of how socio-technical processes work, but also about the broader institutional and information setting in which they are developed (for related discussion, see here). Of course, this is in part a call for more collaboration between ‘technologists’ (such as data scientist or machine learning engineers) and domain experts. But it is also a call for all scholars and policy-makers to engage in critical assessment of logic or assumptions that can be buried in technical analysis or explanations and, as such, difficult to access. Only robust scrutiny of these issues can avoid incurring massive intellectual debt and, perhaps what could be worse, pinning our hopes of improved digital procurement governance on faulty tools.

_____________

* The reference to magic in the title and the introduction relates to Zittrain’s Magic-8 ball metaphor, but also his reference to the earlier observation by Arthur C. Clarke that any sufficiently advanced technology is indistinguishable from magic.

Transatlantic efforts against bid rigging in procurement [free webinar alert]

Prof Chris Yukins and Michael Bowsher QC have put together an excellent webinar on the current approaches to detect and sanction bid rigging in procurement in the US and the EU, as well as the possible future approach the UK may take post-Brexit.

Among other things, the webinar will include discussion of the European Commission’s recent bid rigging exclusion guidance (for initial comments see here).

The webinar will take place on 2 June 3pm CET / 2pm GMT. All welcome. Further details and free registration here.

Cartels in public procurement: A short comment on Heimler's (2012) J Comp L & Econ 8(3): 1-14

Prof. Alberto Heimler has recently published the interesting piece 'Cartels in Public Procurement' (2012) J Comp L & Econ 8(3): 1-14 [available, but maybe for subscribers only, here]. In his paper, Prof. Heimler discusses the specific features of bid-rigging as a particularly stable instance of collusion and presents some proposals to reduce the administrative burden and increase the incentives for procurement officials to track potential instances of bid rigging and to report them to the competition authorities, even on the basis of a mere suspicion (ie without need to provide full proof of the infringement). 

The abstract of his piece shows these general ideas:
Public procurement markets differ from all others because quantities do not adjust with prices but are fixed by the bidding authority. As a result, there is a high incentive for organizing cartels (where the price elasticity of demand is zero below the base price) that are quite stable because there are no lasting benefits for cheaters. In such circumstances, leniency programs are unlikely to help discovering cartels. Since all public procurement cartels operate through some form of bid rotation, public procurement officials have all the information necessary to discover them (although they have to collect evidence on a number of bids), contrary to what happens in normal markets where customers are not aware of the existence of a cartel. However, in order to promote reporting, the structure of incentives has to change. For example, the money saved from a cartel should at least, in part, remain with the administration that helped discover it and the reporting official should reap a career benefit. In any case, competition authorities should create a channel of communication with public purchasers so that the public purchasers would know that informing the competition authority on any suspicion at bid rigging is easy and does not require them to provide full proof.
This 'mainstream' description of his paper is perfectly in line with most economic and legal scholarship in this field and his work is an interesting reminder of the need to increase the liaison between public procurement and competition authorities, as well as to create a set of incentives (or a dedicated position) for public buyers to act as competition watchdogs of sorts or, more generally, as competition advocates [along the same lines, see A Sanchez Graells, Public Procurement and the EU Competition Rules (Oxford, Hart Publishing, 2011) 385-389]. Moreover, Prof. Heimler offers a couple of interesting insights that should be taken into consideration in the design of effective public procurement systems against bid rigging.

On the one hand, Prof. Heimler clearly indicates the diverging financial interests in bidding rings as opposed to 'general' cartels, which make leniency programs (potentially) less effective in this type of market settings:
Contrary to what happens in normal markets, bid-rigging cartels are much more stable. While in normal markets, quantities and prices are found simultaneously, in bidding markets, quantities are set by the organizer of the bid and the bidding is just used to find the lowest price associated with those quantities. Bid riggers know that by reducing prices (with respect of the agreed ones), they do not achieve any increase in the quantities sold. Rather, they just increase their profit at the expense of competitors and, most importantly, only for one bid. Once there is defection for one bid, the cheater knows (because of the transparency rules in public procurement) that he will be discovered and competition will prevail for all future bids. As a result of these characteristics, partly structural and partly rule-based, the incentive to cheat in bid rigging is much less pronounced than in normal markets (where cheating can be kept secret, at least for some time) (p. 7).
On the other hand, the level of transparency that is structurally implicit in public procurement settings makes it much easier for (properly trained) procurement officials to detect instances of bid rigging and to react:
Contrary to normal cartels, where the participating firms agree on prices or on territories so that customers face an information gap with respect to competitive prices, bid rigging in public procurement requires that the participating firms agree on the bid participation strategy (who wins and at what price; who will participate today; and who wins and who participates in future bids). As a result, bid riggers leave a lot of evidence on the strategies pursued that a well-trained public administration official could indeed identify. As a result, while a public procurement cartel is stable on the supply side, it could be discovered by due diligence on the demand side. This is the opposite of what happens with private market cartels (p. 12).
However, Prof. Heimler also includes a couple of final recommendations to make bid rigging more difficult that, in my opinion, would raise more issues than they would solve. Indeed, he proposes that:
There are also some very important procedural and legal steps that should be taken to make bid rigging much more difficult.
The first is to centralize purchases (or make sure that bids are not made artificially too small so that the construction of a large infrastructure project cannot be easily divided up among all the firms in the industry). This way, the information on the different bids can be found within the same organization so that any irregularity across different bids can be more easily identified. Furthermore, a centralized purchasing agency can organize bids of higher value (purchasing for a number of administrations) so that bids would be more infrequent and bid-rigging agreements would be more difficult to maintain.
Also, the rules that favor small firms in their participation in tenders, in which individually they would not be able to participate because of their small size, should be made much more rigorous. In particular, temporary consortia should only be allowed if comprised by firms producing complementary goods or services, while simple horizontal consortia should be prohibited. In fact, temporary consortia between rivals are very often a tool for enforcing a cartel more so than a way to increase competition (p. 13, emphasis added).
In my opinion, the first objective (centralization of information to make detection easier) can be attained simply by improving reporting and analysis mechanisms (along the lines of Articles 83 to 87 in the 2011 proposal for new EU public procurement Directives, now significantly reduced), rather than by conducting centralized procurement (which can lead to market foreclosure and other knock-on effects that are detrimental in economic terms).

Regarding the second proposal, I do not see how restricting SME's participation through consortia (ie limiting participation to larger companies) would reduce rather than increase the likelihood of collusion--since it would be equivalent to creating an oligopolistic (sub)market for larger companies, to which those large(r) contracts would be reserved. 

Hence, I would strongly recommend not taking any of those two actions, at least until some further (empirical) research is conducted in this field.