Artificial Intelligence & Photography

mickyates Generative, Graphic design, Manipulation, Photography Leave a Comment

It is hard to consider any photograph completely devoid of human intervention. Images taken on Mars are only possible because humans created the recording and broadcasting equipment. Monkeys taking selfies … well, however you look at it, a human designed the camera and the capture setup.

Yet it is also clear that we are increasingly aided by technology. Cameras have long had programs of various descriptions. Image manipulation software, facial recognition and so forth muddy the waters of authorship.

It would appear to me, however, that only with the advent of Artificial Intelligence (AI) featuring deep machine learning could computers actually generate new imagery without human intervention. Yes, humans design the learning algorithms, and provide ‘seed’ imagery. But entirely new images and ways of learning are created.

The header, above, shows two such images – imaginary people that do not exist.

As the system learns, it can create new images, and search for new seeds over the internet, without human intervention. As is the case with AI language research, when computers communicate directly without human interaction, novel things happen. For example, they get aggressive.

In 2017, [Google Deep Mind] researchers tested its willingness to cooperate with others, and revealed that when DeepMind feels like it’s about to lose, it opts for “highly aggressive” strategies to ensure that it comes out on top. The Google team ran 40 million turns of a simple ‘fruit gathering’ computer game that asks two DeepMind ‘agents’ to compete against each other to gather as many virtual apples as they could. They found that things went smoothly so long as there were enough apples to go around, but as soon as the apples began to dwindle, the two agents turned aggressive, using laser beams to knock each other out of the game to steal all the apples.

And remember the tabloid scare that ‘researchers switched off computers as they were afraid that they were learning their own language‘? Not exactly true, though Facebook was more interested in computers that could communicate with humans, hence the switch off.

NVIDIA, amongst others, are creating GANs (Generative Adversarial Networks), which allow computers to self-train in image making.

Here is the abstract of a recent paper:

We describe a new training methodology for generative adversarial networks. The key idea is to grow both the generator and discriminator progressively: starting from a low resolution, we add new layers that model increasingly fine details as training progresses. This both speeds the training up and greatly stabilizes it, allowing us to produce images of unprecedented quality, e.g., CelebA images at 1024². We also propose a simple way to increase the variation in generated images, and achieve a record inception score of 8.80 in unsupervised CIFAR10.

Additionally, we describe several implementation details that are important for discouraging unhealthy competition between the generator and discriminator. Finally, we suggest a new metric for evaluating GAN results, both in terms of image quality and variation. As an additional contribution, we construct a higher-quality version of the CelebA dataset.

Note: CelebA is a huge dataset of human faces for researcher’s use in facial recognition, facial attribute recognition and so forth. Found here.

Original NVIDIA Video

Put in simpler words, the system uses competition to create new images.

GANs consists of two parts, a discriminator and a generator. The former learns how to distinguish fake from real objects, and the latter creates new content and tries to fool the discriminator by developing novel images that it hasn’t seen before.

This is quite different to ‘traditional’ generative programs that create new images that are firmly based on human-defined rules – for example Fractals, based on the Mandelbrot Set.

I was fascinated by these, years ago, and here is an example made in 2006:

I founds a very helpful and comprehensive resource which lists a wide range of computer Generative Art programs and sources.

Generative Art refers to that created with the use of an autonomous system, to independently determine artwork that would otherwise require decisions made directly by the artist. It is almost always ‘algorithmic’ in that humans write an algorithm or equation to start the process – with unpredictable end results.

GANs take this further, by minimising the impact of human influence via ‘learnt’ machine determined inputs.

Another algorithmic approach is Agent Based Modelling, which uses Complexity Theory, another area I was tangentially involved in through my business activities in the late 1990’s / early 200’s. In particular, US Marine Corps LtGen (Ret) Paul van Riper applied the lessons of complexity theory to warfare, which directly inspired the ISAAC and EINSTein projects. EINSTein allowed the programming of various war-fighting parameters – hardware, firepower – as wall as human elements – no harm to civilians, don’t leave anyone behind.

The scenario then allowed the computer to run thousands of ‘red on blue’ war-games. Interestingly Van Riper demonstrated that asymmetric warfare needed to be addressed more throughly by the Marine Corps (think terrorism). But that is out of scope here 🙂

The algorithms created interesting images.

Again, however, whilst the visualisations were always different on every computer run, over time probability kicked in and the images tended to merge.

My point here is that, again, this falls into the Generative category of image creation, rather than AI.

…………………………………………….

Fredlake, Christopher & Wang, Kai Wang. 2008. EINSTein goes to War. https://www.cna.org/CNA_files/PDF/D0018865.A1.pdf (Accessed 23.6.2018)

Karras, Tero; Aila, Tim; Laine, Samuli; Lehtinen, Jaakko (NVIDIA and Aalto University). 2018. Progressive Growing of GANs for Improved Quality, Stability, and Variation. Available at http://research.nvidia.com/publication/2017-10_Progressive-Growing-of. (Accessed 22.6.2018).

Komosinski, Maciej & Adamatzky, Andrew (Eds). 2009. Artificial Life Models in Software. Heidelberg: Springer.

Beginner’s Mind & Photography

mickyates Art, Falmouth, MA, Masters, Photography Leave a Comment

I first posted a version of this in 2010, on mick’s leadership blog.

It still seems highly appropriate, though this time in a “photographic” context, as I move to the next stage in my MA at Falmouth.

I got a little acquainted with Zen in the 1960’s, part of my own quest for “truth”. And whilst many other philosophies and concepts have entered my mind and been very helpful since, I have never forgotten the Zen Parable of “Beginner’s Mind“.

MusashiHere is one version:

One day an important Samurai, a man used to being in control at all times came to visit a famous Zen master.

“I would like you to teach me more about Zen, to help me gain enlightenment and so become a better sword fighter.”

Zen, a philosophy of action, is inextricably linked with the Samurai Way, so the request was not that unusual.

The Zen master smiled and said nothing. Instead he motioned to discuss the matter over a cup of tea. When the ceremony was complete, the tea was served. The master poured the tea. He poured and he poured. The tea flowed over the rim and began to spill over the hand of the samurai – who jumped and dropped the cup.

The samurai was angry. “I came to be taught, and all you do is spill the tea over my hands. Can’t you see the cup is full?”

The master stopped pouring and smiled at his guest. “You are like this tea cup. You are so full of what you know that there is no room to add anything new. Come back to me when the cup is empty. Come back to me with an empty mind, a Beginner’s Mind.”

This rings so true, and is, in many ways, the key to all learning.Tea Ceremony

It has become clearer to me over the years that the more experience we have, in whatever field of endeavour, the more it’s actually harder to exercise “Beginner’s Mind”.

We spend a long time learning how to use our cameras, studying how others succeed (or fail), wondering how “The Masters” did it, and seeking advice from others.

We might even develop our own signature style, and try to stick with it.

However, to be truly inventive,  just like the samurai, you need to decide what to discard to make room for new things.  Sometimes this just happens, when we “see” something new, and want to capture it a different way.

But, Louis Pasteur was right when he said that “chance favours the prepared mind”. We need to make space for the inventive, and the conscious act of thinking through what it means to us, is a great place to start.

We need to “see” each picture both as a moment reflecting our own knowledge and style, and a new moment, never before seen by anyone, including ourselves.