Arhiiv kuude lõikes: August 2016

Probability of finding true love

The concept of true love has been invented by poets and other exaggerators. Evolutionarily, the optimal strategy is to settle with a good enough partner, not to seek the best in the world. But suppose for the sake of argument that a person A’s true love is another person B who exists somewhere in the world. What is the probability that A meets B?

There is no a priori reason why A and B have to be from the same country, have similar wealth or political views. Isn’t that what poets would have us believe – that love knows no boundaries, blossoms in unlikely places, etc?

Given the 7 billion people in the world, what fraction of them does a given person meet per lifetime? Depends on what is meant by “meets” – seeing each other from a distance, walking past each other on the street, looking at each other, talking casually. Let’s take literally the cliché “love at first sight” and assume that meeting means looking at each other. A person looks at a different number of people per day depending on whether they live in a city or in the countryside. There is also repetition, i.e. seeing the same person multiple times. A guess at an average number of new people a person looks at per day is 100. This times 365 times a 70-year lifespan is 2555000. Divide 7 billion by this and the odds of meeting one’s true love are thus about one in three thousand per lifetime.

Some questionable assumptions went into this conclusion, for example that the true love could be of any gender or age and that the meeting rate is 100 per day. Restricting the set candidates to a particular gender and age group proportionately lowers the number of candidates met and the total number of candidates, so leaves the conclusion unchanged.

Someone on the hunt for a partner may move to a big city, sign up for dating websites and thereby raise the meeting rate (raise number met while keeping total number constant), which would improve the odds. On the other hand, if recognizing one’s true love takes more than looking at them, e.g. a conversation, then the meeting rate could fall to less than one – how many new people per day do you have a conversation with?

Some people claim to have met their true love, at least in the hearing of their current partner. The fraction claiming this is larger than would be expected based on the calculations above. There may be cognitive dissonance at work (reinterpreting the facts so that one’s past decision looks correct). Or perhaps the perfect partner is with high probability from the same ethnic and socioeconomic background and the same high school class (this is called homophily in sociology). Then love blossoms in the most likely places.

Deflation of academic publications

Lisa kommentaar

The top journals publish a similar number of articles as decades ago, but there is a much larger number of researchers competing to get their work into a top journal. Correspondingly, it is more difficult over time to get a paper into a given journal. If articles are analogous to currency in the academic world, then this would be deflation: the value of the currency rises over time. If articles are like goods and services, but research effort is the currency that buys them, then there is inflation, because the amount of currency required to buy a given good rises.

The correct comparison between publications in different decades would take into account the increasing difficulty of publishing in a given journal. Instead of comparing papers in the top n journals, a better metric is papers in the top x percent of journals (accounting for the possibly expanding size of each journal). Similarly, being the number one researcher among a thousand in 1901 is less impressive than being the best among a million in 2001. Again the right comparison is by percentile rank, not by “top n” status.

The norms and metrics in academia are largely made by senior, established researchers. If people do not completely account for the deflation, then the top academics benefit from the increasing difficulty of publishing in the top n journals combined with the metric that counts the top n, not the top x percent. The research of old academics that was published in the top n long ago looks the more impressive the more difficult it is nowadays to get a paper into the top n. Comparison by percentile rank would correct for this artificial advantage, so the established members of the profession would not seem as high-achieving relative to new entrants.

A similar change in difficulty has occurred in getting accepted as a student in the top n universities, or getting hired as faculty in these. The right comparison to the students or faculty decades ago would compare the top x percent of universities, with the appropriate correction if the universities have expanded their enrollment or number of jobs.

Anonüümse kasutajanime valik

Lisa kommentaar

Kui tahta anonüümseks jääda, ei tohiks kasutajanimi ega salasõna (ega midagi muud sisestatut) sisaldada infot sinu tausta kohta. Ehk kasutajanimi ega salasõna ei tohiks olla eesti keeles, vaid peaks olema inglise või mõnes muus suures keeles. Need ei tohiks sisaldada kultuurilisi viiteid (kalevsson, forestbrother), vaid peaksid olema näiteks massilevikuga arvutimängudest või filmidest (raider, halfelven, di3hard).

Väga intelligentsetest kasutajanimedest (morslongavitabrevis, lenfercestlesautres) tuleks hoiduda, kui tahta haridustausta varjata.

Nime valik lapsele

Lisa kommentaar

„Ütle mulle üks ilus poisslapse nimi!“

Unikaalsus tundub nime puhul olevat hea omadus – inimest ei aeta dokumentides kellegagi segamini. Ei saa muidugi kindel olla, et tulevikus keegi oma lapsele täpselt sama nime ei pane, aga selle tõenäosust saab vähendada, valides haruldase eesnime. Kui perekonnanimi on unikaalne, siis ükskõik millise eesnime puhul on terve nimi ainus maailmas praegu ja tõenäoliselt lähemas tulevikus. Levinud perekonnanime puhul on unikaalse nime valik keeruline.

Nimi võiks olla lihtne kirja panna, välja öelda ja meelde jätta. Lihtsus on suures osas mõõdetav tähtede arvuga nimes. Poistel on lihtsaimad vist kolmetähelised nimed (Enn, Ain, Uku), järgmiseks neljatähelised (Tiit, Olev, Ahto, Ants) jne. Tüdrukutel on ka Ly, kuigi see on võõras, muidu kolmetähelised Ela, Tea, Eve, Anu, Lea. Lühikest nime pole ka vaja hüüdnime jaoks lühemaks lõigata.

Bürokraatlike probleemide vähendamiseks reisimisel peaks vältima riigispetsiifilisi tähti õ, ä, ö, ü. Igasugustesse andmebaasidesse sisestamisel on erisümbolid samuti probleemiks. Eesti nimedest tekitavad võõrkeeltes kindlasti probleeme Õie, Krõõt, Äili, Ülo, Väino, Pärt.

Nimi võiks väljaütlemisel ilusasti kõlada, eriti tüdrukutel. Kõla sõltub muidugi kõneleja keelest (Euridice eesti hääldus Euridiitse, inglise hääldus Juridiss), nii et peaks arvestama keelega, milles seda nime tõenäoliselt kõige sagedamini ütlema hakatakse. Minu jaoks kõlavad kõige ilusamini sõnad, milles on palju e, i ja l häälikuid (seetõttu pean ma eesti keelt väga ilusaks), nii et näiteks tüdrukunimede puhul eelistaksin ma Elet Anule, Ellenit Krõõdale.

Hea oleks, kui nimi ei tähendaks midagi halba, eriti mõnes suure kõnelejate arvuga keeles. Näiteks Reet pidavat hollandi keeles tagumikku tähendama. Seda võib olla raske vältida, kuna keeli on maailmas palju ja släng muutub pidevalt, nii et tulevikus võib iga nimi muutuda mõnes keeles roppuseks.

Nime ja mõne sellele lähedaste sõnade tähendus võiks olla midagi positiivset ja lapse sooga sobivat, et koolis mõnitatud saamine vähem tõenäoline oleks. Sellised nimed on näiteks tüdrukutel Õie, Lembi, Helle, Aasa, poistel Kalev, Mehka, Mehis, Karmo, Tarmo, Ott. Tähenduse poolest ei meeldi mulle naisenimedest Aita, Leili, mehenimedest Mats. Sooga sobivus sõltub keelest: Eli võib eesti keeles naisenimi olla, aga on juudi mehenimi.

On photos at tourist attractions

Lisa kommentaar

At every tourist attraction, there are numerous people taking pictures of the attraction, themselves and their companions. The same photos have been taken thousands of times before and are available on the internet. It would save a lot of time for people overall if someone wrote a computer program that photoshops a person or group into these pictures. Basically, pick a location of which there are photos available online and load some pictures of yourself into the program, which returns photos of you at this place. With this, everyone can skip the photoshoot at the tourist sites, save money on the camera(phone) and still obtain all the generic tourist photos they would have had under the current system.

The next step for attractions that consist of sight and sound only is to experience them through virtual reality goggles instead of actually going there. It is more environmentally friendly, safer and cheaper this way. Most tourist attractions fall into the visual-auditory category, e.g. architecture, museums, monuments, some of nature tourism.

Technological advances are required before tourist attractions that rely on smell, taste or touch (physically doing something, e.g. surfing) are replaced with virtual reality.

Kunsti loomine arvutiga

Lisa kommentaar

Kunsti loomiseks arvutiga on kolm taset – testimine, poolautomaatne genereerimine ja täisautomaatne genereerimine.

Testimiseks loob inimene mingi ühiku potentsiaalset kunsti, programm võrdleb seda olemasoleva kunsti andmebaasiga ja annab infot selle kohta, kui palju ja kus esitatud ühik andmebaasi keskmisest mingite parameetrite poolest erineb (raamatul nt sõnade pikkuse, esinemissageduse, lausete pikkuse ja liigi).

Poolautomaatne genereerimine on see, kui programm loob elemente kunstivormist, inimene valib nende hulgast mõned ja paneb kokku. Arvuti võib näiteks luua luuleridu etteantud silpide arvu ja riimiskeemiga. Inimene valib mõned ja paneb luuletuseks kokku. Arvuti võib enne võrrelda genereeritud elemente andmebaasis olevate teostega, et vältida plagiaati.

Täisautomaatne genereerimine on see, kui arvuti loob valmis kunstiühiku. Näiteks võtab arvuti maalide andmebaasi, kasutab kujutise äratundmise algoritme, et maalide elemendid eraldada. Siis paneb arvuti teatud viisil valitud maalide elemendid kokku üheks pildiks ja loob sellele pildile ühtse stiili, nt imiteerides pintslitõmbeid või moonutades kujutisi teatud viisil.

Tänapäeval suudaksid arvutid õige programmi abil täisautomaatselt genereerida luuletusi, sõnamänge, maale, skulptuure (3D printimise abil), muusikapalu. Poolautomaatselt saaks lisaks genereerida sisuplaane näidendite, raamatute, filmide jaoks. Nende sisuplaanide põhjal võib siis inimene vastava kunstiühiku luua, lisades sisu kokkuvõttele kirjeldused ja detailid.

Testida saaks piisava arvutusvõimsuse olemasolul peaaegu kõiki kunstiliike. Esinemiskunsti puhul peaks selle mitme nurga alt filmima, et arvuti seda 3D-s võrrelda saaks. Kõige selle jaoks on vaja eelneva kunsti andmebaase.