Autoriarhiiv: sanhei

Computer vision training sets of photos are endogenous

In principle, every pixel could be independent of any other, so the set of possible photos is the number of pixels times the number of colours – billions at least. No training data set is large enough to cover these photo possibilities many times over, as required for statistical analysis, of which machine learning is a subfield. The problem is solved by restricting attention to a small subset of possible photos. In this case, there is a reasonable number of possible photos, which can be covered by a reasonably large training data set.

Useful photos on any topic usually contain just one main object, such as a face, with less than 100 secondary objects (furniture, clothes, equipment). There is a long right tail – some useful photos have dozens of the main object, like a group photo full of faces, but I do not know of a photo with a thousand distinguishable faces. Photos of mass events may have ten thousand people, but lack the resolution to make any face in these useful.

Only selected photos are worth analysing. Only photos sufficiently similar to these are worth putting in a computer vision training dataset. The sample selection occurs both on the input and the output side: few of the billions of pixel arrangements actually occur as photos to be classified by machine vision and most of the training photos are similar to those. There are thus fewer outputs to predict than would be generated from a uniform random distribution and more inputs close to those outputs than would occur if input data was uniform random. Both speed learning.

When photo resolution improves, more objects of interest may appear in photos without losing usefulness to blur. Then such photos become available in large numbers and are added to the datasets.

Turumanipulatsioon Funderbeamis iseendale müües

Lisa kommentaar

Idufirmade kapitalikaasamise ja nende osakutega kauplemise platvormil Funderbeam pole internetiotsingu kohaselt kümne aasta jooksul ükski idufirma edukalt väljunud (börsile läinud, kellegi poolt ära ostetud), kuigi Funderbeami hinnagraafikute kohaselt on mõnede idufirmade osakute hinnad kümneid kordi tõusnud. Tekib küsimus, kas Funderbeamis näidatavad graafikud ikka peegeldavad tegelikku turgu.

Üks võimalus, kuidas näiteks idufirma X omanik võib selle osaku hinda Funderbeami (ja muude ühisrahastusplatvormide) järelturul üles pumbata, on müüa iseenda muude firmade Y ja Z vahel idufirma X osakut kõrge hinnaga (Y ja Z on Funderbeami investorid, aga X Funderbeamis kaubeldav ettevõte). Selline turumanipulatsioon iseendalt ostmise abil on võimalik, sest erinevalt börsist, kus tehing sõlmitakse anonüümselt parima hinna pakkujaga, saab Funderbeami järelturul valida, kellega tehingut teha. Seega saab vastaspooleks valida endaga seotud kaupleja. Siis pole karta raha kaotamist, ostes osakuid kõrge hinnaga, sest ostetakse iseendalt.

Sõnades Funderbeam muidugi keelab turumanipulatsiooni, aga sõnades on neil ka palju teisi reegleid, mida nad ei jõusta, näiteks idufirmade kohustus iga kvartali aruanne esitada. Funderbeami esindaja isegi tunnistas, et nad ei tee muud kui meeldetuletuste saatmine kui kaubeldavad idufirmad aruandekohustust ei täida.

Kui idufirma osaku hinnagraafik joonistatakse lihtsalt viimaste tehingute keskmise hinna põhjal, siis on hinnapumpamine eriti lihtne, sest piisab ühe osaku edasi-tagasi kauplemisest piisav arv kordi enda valitud hinnaga enda firmade Y ja Z vahel. Isegi kui Funderbeam joonistab hinnagraafiku tehingute väärtustega kaalutud keskmise põhjal, saab hinna üles ajada – peab vaid ajaühikus suurt mahtu enda firmade vahel liigutama. Selleks ei pea ise palju osakuid omama, lihtsalt peab palju edasi-tagasi tehinguid tegema. Kauplemistasu ju ei ole, tehingud on tasuta, nii et turumanipulatsioon ka.

Kui keegi teine teeb madala hinnaga tehingu ja seega osaku hinnagraafiku alla tõmbab, võib manipuleerija arvutiprogramm selle kiirelt tuvastada ja teha palju kõrge hinnaga tehinguid Y ja Z vahel. See lööb hinna jälle üles.

Funderbeamil on motiiv sellist hinnapumpamist lubada (ma kindlasti ei väida, et nad seda tegelikult lubavad või et keegi seda teeb), sest nii tundub uutele investoritele, et Funderbeamis kaubeldavate idufirmade osakute väärtused tõusevad, mis meelitab investoreid platvormile. Funderbeam ei võta investoritelt tasu, küll aga idufirmadelt, kelle osakuid noteerib. Idufirmadele meeldib platvorm, kus on palju investoreid, kes nende osakuid esmaemissioonil ostavad. Konkurents investorite vahel laseb idufirmal osaku eest kõrgemat hinda küsida, ehk kaasata rohkem kapitali või müüa sama raha eest väiksema osa ettevõttest. Kaotajaks jäävad investorid, kes hinnagraafikul olevast numbrist isegi tunduvalt odavamalt oma osakuid müüa ei suuda. Millegipärast ostetakse vahel järelturul kõrge hinnaga pakutavaid osakuid enne madalamaid.

Üks seletus on, et kuna pakutava müügiorderi peab ostma täies mahus, siis ostja valib endale sobiva koguse isegi kui peab selle eest kõrgemat ühikuhinda maksma. Suure hinnavahe korral ei tundu see tõenäoline. Samuti saab foorumisse kirjutada soovi teatud kogust osta koos palvega, et osaku müüjad oma suure orderi väiksemateks tükkideks jagaksid.

Hinnapumpamist takistaks börsisarnane „parima hinnaga tehingu” reegel, et tehing sõlmitakse turu vastaspoolel parimat hinda pakkuva kauplejaga. Kui keegi tahaks kõrge hinnaga iseendale müüa, peaks ta nii ostu- kui müügiorderi börsile edastama. Kõrge hinnaga müügiorderit ei võtaks teised kauplejad vastu, aga ostuorder täidetaks teiste kauplejate müügipakkumisega, mille hinnad on madalamad. Nii peaks manipuleerija ostma kõik pakutavad müügiorderid kuni tema soovitud hinnatasemeni, et hinda sellele tasemele pumbata. Need ostud läheksid väga kulukaks ja koormaksid manipuleerijat suure koguse madala väärtusega osakutega, mille eest ta kõrget hinda maksis.

Teisest mullipumpamise taktikast ühisrahastuses olen kirjutanud postituses Overbidding incentives in crowdfunding, mis räägib esmaemissiooni, mitte järelturu manipulatsioonist.

Talgud kui laenuvõtt

Lisa kommentaar

Külaühiskonnas püstitati uus hoone tihti talgukorras. Naiivselt võiks arvata, et teised külaelanikud tegid ühele perele heateo, ehitades neile tasuta maja. Tegelikult eeldasid ehitustalgulised, et kasu saanud pere osaleb edaspidi ise talgutel, kus midagi ehitajate kasuks tehakse, ehk maksab talguliste algse töö oma hilisema tööga tagasi. Sisuliselt oli tegu tööjõu laenamisega. Kui laenu saanud pere oleks keeldunud edaspidi teistele talgutööd tegemast, oleks küla neid karistanud vähemalt sotsiaalse halvakspanu ja verbaalse vägivallaga, ilmselt ka keeldunud tööriistu või näljaajal toitu laenamast.

Laen loob väärtust kui laenatavat vara saab kohe kasutama hakata, selle asemel, et vara ostmiseks säästa, lükates kasutamise algust edasi kuni piisav summa koguneb. Ühel taluperel kuluks hoone püstitamiseks mitu kuud, mille jooksul peaksid nad elama ajutises onnis, mille peaks omakorda enne ehitama. Talguliste tööd laenates saavad nad paari päevaga sisse kolida ja onni ehituskulu jääb ära. Majas on kütmine, söögitegemine ja toidu säilitamine efektiivsem kui onnis.

Tänapäeval on majalaenul sarnane eesmärk kui põllumajandusühiskonnas talgutöö laenamisel. Selle asemel, et 25 aastat üürikorteris sääste koguda ja elu lõpus maja osta, võib laenata, kohe majja kolida ja seal elades 25-30 aastat laenu tagasi maksta. Intressi tõttu on tagasimaksed kokku küll suuremad kui algne laen, aga ometi on laen kasulik, sest heaolu suuremast ja püsivamast eluasemest ületab intressikulu.

Jõusaalis käimine vs treenimine

Lisa kommentaar

Tehkem vahet jõusaalis käimise ja jõusaalis treenimise vahel. Nende jaoks, kes jõusaalimasinatel istuvad ja telefoni näpivad või omavahel juttu ajavad, võiks paigaldada seina äärde istepingid. Nad saavad seal istuda ja pärast ausalt väita, et käisid jõusaalis – kõndisid pingini ja tagasi. Kõik võidavad, sest pingid on istumiseks mugavamad kui masinad, mis omakorda vabanevad trennitegijatele.

Jõusaalimasinatele võiks paigaldada sellised elektrilised seadmed, et kui keegi masinal istub, aga seda ei liiguta (trenni ei tee), siis saab särtsu tagumikku. See motiveeriks jõusaalis käijaid/istujaid masinat vabastama, et teised saaksid seda kasutada.

Teksti ja kõne automaatse teisendamise treeningandmed

Lisa kommentaar

Masinõppeks on vaja suurt andmehulka. Vähese kõnelejate arvuga keele puhul on kallis palgata inimesi arvutile tekste ette lugema, et arvuti õpiks teksti kõneks teisendama. Samuti on kallis kõne kirjapanijaid palgata, et arvuti õpiks kõnet tekstiks tõlkima. Peaks leidma juba nii kõnes kui kirjas olemas olevaid tekste, mida masinõppele sööta. Üks näide on raadiointervjuud, mis ka ajalehes avaldatakse (Kuku Raadio saade Restart Postimehe majanduslehekülgedel, Toomas Sildami intervjuud). Probleemiks on suulise intervjuu toimetamine enne kirjalikku avaldamist, mis tähendab, et kõne ja tekst ei ole üksüheses vastavuses. Natuke kasu sellest siiski arvutile on, sest osad laused korduvad kõnes ja kirjas. Kõnenäidiseid ilma kirjaliku vasteta saab audiovisuaalsest ajakirjandusest hulganisti, samuti kirjalikku teksti helilise vasteta trükiajakirjandusest.

Kui koolides on tehtud etteütlusi arvutisse trükituna ja nende suuline salvestis on ka saadaval, siis saab neid arvuti treenimiseks kasutada.

Näidendite ja filmide tekstid ja helisalvestised on samuti vastavuses kõne ja kiri. Samuti laulusõnad ja laulud, aga laulu hääldus ja rütm on tavakõnest erinev, nii et see ei pruugi arvutit aidata muus kui laulude tõlkimisel heli ja kirja vahel.

Audioraamatud, mille puhul ettelugemine vastab kirjapandule, on head pikad sama häälega loetud tekstid arvuti treenimiseks.

Maailma suuremate keelte edukamad masintõlkefirmad on ilmselt leidnud palju teisi treeningandmete hulki, mida oma programmide arendamiseks kasutada.

Moon phase and sleep correlation is not quite a sine wave

Lisa kommentaar

Casiraghi et al. (2021) in Science Advances (DOI: 10.1126/sciadv.abe0465) show that human sleep duration and onset depends on the phase of the moon. Their interpretation is that light availability during the night caused humans to adapt their sleep over evolutionary time. Casiraghi et al. fit a sine curve to both sleep duration and onset as functions of the day in the monthly lunar cycle, but their Figure 1 A, B for the full sample and the blue and orange curves for the rural groups in Figure 1 C, D show a statistically significant deviation from a sine function. Instead of same-sized symmetric peaks and troughs, sleep duration has two peaks with a small trough between, then a large sharp trough which falls more steeply than rises, then two peaks again. Sleep onset has a vertically reflected version of this pattern. These features are statistically significant, based on the confidence bands Casiraghi and coauthors have drawn in Figure 1.

The significant departure of sleep patterns from a sine wave calls into question the interpretation that light availability over evolutionary time caused these patterns. What fits the interpretation of Casiraghi et al. is that sleep duration is shortest right before full moon, but what does not fit is that the duration is longest right after full and new moons, but shorter during a waning crescent moon between these.

It would better summarise the data to use the first four terms of a Fourier series instead of just the first term. There seems little danger of overfitting, given N=69 and t>60.

A questionable choice of the authors is to plot the sleep duration and onset of only the 35 best-fitting participants in Figure 2. A more honest choice yielding the same number of plots would pick every other participant in the ranking from the best fit to the worst.

In the section Materials and Methods, Casiraghi et al. fitted both a 15-day and a 30-day cycle to test for the effect of the Moon’s gravitational pull on sleep. The 15-day component was weaker in urban communities than rural, but any effect of gravity should be the same in both. By contrast, the effect of moonlight should be weaker in urban communities, but the urban community data (Figure 1 C, D green curve) fits a simple sine curve better than rural. It seems strange that sleep in urban communities would correlate more strongly with the amount of moonlight, like Figure 1 shows.

Clinical trials of other drugs in other species to predict a drug’s effect in humans

Lisa kommentaar

Suppose we want to know whether a drug is safe or effective for humans, but do not have data on what it does in humans, only on its effects in mice, rats, rhesus macaques and chimpanzees. In general, we can predict the effect of the drug on humans better with the animal data than without it. Information on “nearby” realisations of a random variable (effect of the drug) helps predict the realisation we are interested in. The method should weight nearby observations more than observations further away when predicting. For example, if the drug has a positive effect in animals, then predicts a positive effect in humans, and the larger the effect in animals, the greater the predicted effect in humans.

A limitation of weighting is that it does not take into account the slope of the effect when moving from further observations to nearer. For example, a very large effect of the drug in mice and rats but a small effect in macaques and chimpanzees predicts the same effect in humans as a small effect in rodents and a large one in monkeys and apes, if the weighted average effect across animals is the same in both cases. However, intuitively, the first case should have a smaller predicted effect in humans than the second, because moving to animals more similar to humans, the effect becomes smaller in the first case but larger in the second. The idea is similar to a proportional integral-derivative (PID) controller in engineering.

The slope of the effect of the drug is extra information that increases the predictive power of the method if the assumption that the similarity of effects decreases in genetic distance holds. Of course, if this assumption fails in the data, then imposing it may result in bias.

Assumptions may be imposed on the method using constrained estimation. One constraint is the monotonicity of the effect in some measure of distance between observations. The method may allow for varying weights by adding interaction terms (e.g., the effect of the drug times genetic similarity). The interaction terms unfortunately require more data to estimate.

Extraneous information about the slope of the effect helps justify the constraints and reduces the need for adding interaction terms, thus decreases the data requirement. An example of such extra information is whether the effects of other drugs that have been tested in these animals as well as humans were monotone in genetic distance. Using information about these other drugs imposes the assumption that the slopes of the effects of different drugs are similar. The similarity of the slopes should intuitively depend on the chemical similarity of the drugs, with more distant drugs having more different profiles of effects across animals.

The similarity of species in terms of the effects drugs have on them need not correspond to genetic similarity or the closeness of any other observable characteristic of these organisms, although often these similarities are similar. The similarity of interest is how similar the effects of the drug are across these species. Estimating this similarity based on the similarity of other drugs across these animals may also be done by a weighted regression, perhaps with constraints or added interaction terms. More power for the estimation may be obtained from simultaneous estimation of the drug-effect-similarity of the species and the effect of the drug in humans. An analogy is demand and supply estimation in industrial organisation where observations about each side of the market give information about the other side. Another analogy is duality in mathematics, in this case between the drug-effect-similarity of the species and the given drug’s similarity of effects across these species.

The similarity of drugs in terms of their effects on each species need not correspond to chemical similarity, although it often does. The similarity of interest for the drugs is how similar their effects are in humans, and also in other species.

The inputs into the joint estimation of drug similarity, species similarity and the effect of the given drug in humans are the genetic similarity of the species, the chemical similarity of the drugs and the effects for all drug-species pairs that have been tested. In the matrix where the rows are the drugs and the columns the species, we are interested in filling in the cell in the row “drug of interest” and the column “human”. The values in all the other cells are informative about this cell. In other words, there is a benefit from filling in these other cells of the matrix.

Given the duality of drugs and species in the drug effect matrix, there is information to be gained from running clinical trials of chemically similar human-use-approved drugs in species in which the drug of interest has been tested but the chemically similar ones have not. The information is directly about the drug-effect-similarity of these species to humans, which indirectly helps predict the effect of the drug of interest in humans from the effects of it in other species. In summary, testing other drugs in other species is informative about what a given drug does in humans. Adapting methods from supply and demand estimation, or otherwise combining all the data in a principled theoretical framework, may increase the information gain from these other clinical trials.

Extending the reasoning, each (species, drug) pair has some unknown similarity to the (human, drug of interest) pair. A weighted method to predict the effect in the (human, drug of interest) pair may gain power from constraints that the similarity of different (species, drug) pairs increases in the genetic closeness of the species and the chemical closeness of the drugs.

Define Y_{sd} as the effect of drug d in species s. Define X_{si} as the observable characteristic (gene) i of species s. Define X_{dj} as the observable characteristic (chemical property) j of drug d. The simplest method is to regress Y_{sd} on all the X_{si} and X_{dj} and use the coefficients to predict the Y_{sd} of the (human, drug of interest) pair. If there are many characteristics i and j and few observations Y_{sd}, then variable selection or regularisation is needed. Constraints may be imposed, like X_{si}=X_i for all s and X_{dj}=X_j for all d.

Fused LASSO (least absolute shrinkage and selection operator), clustered LASSO and prior LASSO seem related to the above method.

Sõidueksami ajad kiirkorras suurema riigilõivu eest

Lisa kommentaar

Transpordiamet võiks pakkuda sõidueksami aega ka kiirkorras suurema riigilõivu eest nagu PPA pakub passi kättesaamist (https://www.politsei.ee/et/juhend/eesti-passi-taotlemine-taeiskasvanule/taotlemine-iseteeninduses: passi riigilõiv on iseteeninduses taotlemisel 35 €. …kiirkorras on riigilõiv 58 €.). Praegu ostavad-müüvad eksamitegijad omavahel varasemaid aegu, näiteks https://www.facebook.com/groups/arkeksamiteajaduleeesti ja maksavad isegi vabade eksamiaegade teavituste eest 10-25 € (https://eksamiajad.ee/).

Aegade eraviisiline kauplemine tekitab motiivi hangeldamiseks: panna aeg kinni, et seda kellelegi edasi müüa. Kui ei õnnestu müüa, siis lükata aeg edasi hilisemaks enne kui tühistamise eest uuesti lõivu peab maksma. Hangeldamise tagajärjel jääb see aeg suurema tõenäosusega kasutamata, sest teised, kes seda aega vajaksid, aga osta ei taha või ostuvõimalusest õigel ajal ei tea, ei registreeru sellele edasilükkamisel vabanenud ajale. Transpordiametil jääb see aeg tühjaks, järjekorrad pikenevad, hilised edasilükkamised raskendavad graafiku planeerimist nii Transpordiametile kui eksamitegijatele.

Kui Transpordiamet ise ei saa kõrgema lõivu eest kiirkorras aega pakkuda, võiks selleks ministeeriumist luba küsida. Vajadusel küsida, et ministeerium vastavaid määrusi muudaks.

Kiirkorras aegade pakkumisega Transpordiamet sisuliselt konkureeriks hangeldajatega. Kui hangeldamine tekitab piisavalt suure järjekordade pikenemise, siis teeks Transpordiameti konkurents aegade jaotuse efektiivsemaks.

Transpordiamet saaks oma andmete põhjal teatud täpsusega kontrollida, kes aegadega hangeldab. Kui keegi lükkab sõidueksami aega korduvalt edasi just enne kui tal tekiks uue riigilõivu maksmise kohustus, siis tõenäoliselt üritas ta aega müüa, aga ei leidnud ostjat. Kui inimene lükkab lihtsalt aega mitu korda edasi ja sama aja paneb iga kord kiiresti peale selle vabanemist kinni keegi teine, siis tõenäoliselt müüs inimene aja iga kord kellelegi teisele.

Transpordiameti vastus

Teie ettepanek on huvitav, paraku juba läbi arutatud ja kõlbmatuks tunnistatud. Põhjusi on mitu:

Eetiline. Selline süsteem looks eelise neile, kes on nõus rohkem maksma. Transpordiamet on teinud mitmeid arendusi ja protseduurireeglite muutusi, et esimesele katsele tulev juhikandidaat oleks eelistatud järeleksamite ees. Lisaks asjaolu, et kiirpassi taotlustest rahuldatakse valdav enamus, riiklike sõidueksamite sooritus on kuskil 50% peal. See tekitaks küsitavusi protsessi läbipaistvuse osas. Üheks korruptsiooni tajumise parameetriks on just nimelt see, et protsessid on läbipaistvad ja võimalikult ühetaolised kõigile klientidele.
Majanduspoliitiline. Transpordiamet on täidesaatva võimu asutus, mitte tulu teeniv organisatsioon. Riigilõivu mõte on katta tehtud kulutusi, mitte teenida kasumit. Suurem riigilõiv eeldaks Transpordiametilt ka täiendavaid kulutusi, näiteks kallimat eksamisõidukit. Nn kiirpassi puhul võib pealiskaudsel mõtisklemisel leida teatavaid täiendavaid kulutusi – trükikoja koormuse kasv, menetleva ametniku koormuise kasv, täiendavad kulleriteenused trükikoja ja PPA teenindusbüroo vahel.
Lõpetuseks sooviks Teid ikkagi tänada huvitava mõttearenduse eest.
Parimate soovidega
[nimi eemaldatud]

Minu vastuväited 26.01.2021

Transpordiameti kumbki põhjendus ei ole veenev.

1) Eks igaühel on erinev maitse eetika osas. Praegune süsteem, kus eksamiaegadega kaubeldakse, loob veel suurema eelise neile, kes on nõus rohkem maksma, kui kiirema pääsu eest kõrgema riigilõivu küsimine. Lühemad järjekorrad vähendaksid rohkem maksjate eelist. Üks viis järjekordi lühendada on kui Transpordiamet konkureerib aegadega kauplejatega.

Sama läbipaistev kui kiirpassi süsteem oleks, et kes maksab suurema riigilõivu, saab kiiremini eksamile, aga kindlasti mitte lihtsama eksami, nagu ka passile õigust mitteomav isik ei saa suurema lõivu eest passi. Seega eksami läbimise protsent ei puutu asjasse. Mingeid küsitavusi läbipaistvuse osas ma ei näe.

2) Kui Transpordiamet ei soovi suurema riigilõivuga tulu teenida, võib ta selle annetada riigituludesse või heategevuseks. Võib ka selle arvelt alandada tavajärjekorras eksamilepääsejate riigilõivu, mis oleks praegune süsteem (kiirem pääseja maksab aeglasemale aja eest) Transpordiameti kaudu, aga ametlikult ja efektiivsemalt.

Transpordiametil pole põhjust suuremaid kulutusi teha.

Kiirpassil tunduvad olevat täiendavad kulutused ainult pealiskaudsel mõtisklemisel. Lõpuks antakse välja ikka sama arv passe ja nende tootmise ja menetlemise kulu on sama. Võibolla kullerteenus lisab pisut hinda, aga kindlasti mitte riigilõivude vahe jagu. Korraga saadetakse ju PPA büroosse palju passe, nii et saatmiskulu jaguneb nende vahel. Suurema riigilõivu maksjad pannakse lihtsalt järjekorra etteotsa.

Transpordiamet võib isegi lühemate järjekordadega kulutusi kokku hoida. Kui ooteaeg on lühem, siis on eksamieelset unustamist vähem, läbikukkumisi sama eksamiraskusega vähem. Keskmine eksamitegija teeb siis väiksema arvu eksameid, mis vähendab Transpordiameti kulu.

Lõpuks soovin tänada Transpordiametit kiire vastuse eest.

Leader turnover due to organisation performance is underestimated

Lisa kommentaar

Berry and Fowler (2021) “Leadership or luck? Randomization inference for leader effects in politics, business, and sports” in Science Advances propose a method they call RIFLE for testing the null hypothesis that leaders have no effect on organisation performance. The method is robust to serial correlation in outcomes and leaders, but not to endogenous leader turnover, as Berry and Fowler honestly point out. The endogeneity is that the organisation’s performance influences the probability that the leader is replaced (economic growth causes voters to keep a politician in office, losing games causes a team to replace its coach).

To test whether such endogeneity is a significant problem for their results, Berry and Fowler regress the turnover probability on various measures of organisational performance. They find small effects, but this underestimates the endogeneity problem, because Berry and Fowler use linear regression, forcing the effect of performance on turnover to be monotone and linear.

If leader turnover is increased by both success (get a better job elsewhere if the organisation performs well, so quit voluntarily) and failure (fired for the organisation’s bad performance), then the relationship between turnover and performance is U-shaped. Average leaders keep their jobs, bad and good ones transition elsewhere. This is related to the Peter Principle that an employee is promoted to her or his level of incompetence. A linear regression finds a near-zero effect of performance on turnover in this case even if the true effect is large. How close the regression coefficient is to zero depends on how symmetric the effects of good and bad performance on leader transition are, not how large these effects are.

The problem for the RIFLE method of Berry and Fowler is that the small apparent effect of organisation performance on leader turnover from OLS regression misses the endogeneity in leader transitions. Such endogeneity biases RIFLE, as Berry and Fowler admit in their paper.

The endogeneity may explain why Berry and Fowler find stronger leader effects in sports (coaches in various US sports) than in business (CEOs) and politics (mayors, governors, heads of government). A sports coach may experience more asymmetry in the transition probabilities for good and bad performance than a politician. For example, if the teams fire coaches after bad performance much more frequently than poach coaches from well-performing competing teams, then the effect of performance on turnover is close to monotone: bad performance causes firing. OLS discovers this monotone effect. On the other hand, if politicians move with equal likelihood after exceptionally good and bad performance of the administrative units they lead, then linear regression finds no effect of performance on turnover. This misses the bias in RIFLE, which without the bias might show a large leader effect in politics also.

The unreasonably large effect of governors on crime (the governor effect explains 18-20% of the variation in both property and violent crime) and the difference between the zero effect of mayors on crime and the large effect of governors that Berry and Fowler find makes me suspect something is wrong with that particular analysis in their paper. In a checks-and-balances system, the governor should not have that large of influence on the state’s crime. A mayor works more closely with the local police, so would be expected to have more influence on crime.

Reduce temptation by blocking images

Lisa kommentaar

Web shops try to tempt customers into unnecessary and even harmful purchases, including grocery and food ordering sites which promote unhealthy meals. The temptation can be reduced by blocking images on shopping websites. I find it useful when ordering food. Similarly, Facebook and news sites try to tempt viewers with clickbait and ads. To reduce my time-wasting, I make the clickbait less attractive by blocking images. The pictures in most news stories do not contribute any information – a story about a firm has a photo of the main building or logo of the firm or the face of its CEO, a “world leaders react to x” story has pictures of said leaders.

The blocking may require a browser extension (“block images”) and each browser and version has a little different steps for this.

In Chromium on 20 Jan 2021, no extension is needed:

1) click the three vertical dots at the top right,

2) click Settings to go to chrome://settings/,

3) scroll down to Site settings, click it,

4) scroll down to Images, click it.

5) Click the Add button to the right of the Block heading. A dialog pops up to enter a web address.

6) Copy the url of the site on which you want to block pictures, for example https://webshop.com into the Site field.

If seeing the images is necessary for some reason, then re-enable images on the website: follow steps 1-4 above, then click the three vertical dots under the Add button under the Block heading. A menu of three options pops up. Click the Allow option.

Alternatively, you may block all images on all websites and then allow only specific sites to show images. For this, follow steps 1-4 above, then click the blue button to the right of the Allow all (recommended) heading. Then click the Add button next to Allow. A dialog pops up to enter a web address. Copy the url of the site on which you want to block pictures, for example https://webshop.com into the Site field.