Tag: audio

Saturday soundings

Black Lives Matter. The money from this month’s kind supporters of Thought Shrapnel has gone directly to the 70+ community bail funds, mutual aid funds, and racial justice organizers listed here.


IBM abandons ‘biased’ facial recognition tech

A 2019 study conducted by the Massachusetts Institute of Technology found that none of the facial recognition tools from Microsoft, Amazon and IBM were 100% accurate when it came to recognising men and women with dark skin.

And a study from the US National Institute of Standards and Technology suggested facial recognition algorithms were far less accurate at identifying African-American and Asian faces compared with Caucasian ones.

Amazon, whose Rekognition software is used by police departments in the US, is one of the biggest players in the field, but there are also a host of smaller players such as Facewatch, which operates in the UK. Clearview AI, which has been told to stop using images from Facebook, Twitter and YouTube, also sells its software to US police forces.

Maria Axente, AI ethics expert at consultancy firm PwC, said facial recognition had demonstrated “significant ethical risks, mainly in enhancing existing bias and discrimination”.

BBC News

Like many newer technologies, facial recognition is already a battleground for people of colour. This is a welcome, if potential cynical move, by IBM who let’s not forget literally provided technology to the Nazis.


How Wikipedia Became a Battleground for Racial Justice

If there is one reason to be optimistic about Wikipedia’s coverage of racial justice, it’s this: The project is by nature open-ended and, well, editable. The spike in volunteer Wikipedia contributions stemming from the George Floyd protests is certainly not neutral, at least to the extent that word means being passive in this moment. Still, Koerner cautioned that any long-term change of focus to knowledge equity was unlikely to be easy for the Wikipedia editing community. “I hope that instead of struggling against it they instead lean into their discomfort,” she said. “When we’re uncomfortable, change happens.”

Stephen Harrison (Slate)

This is a fascinating glimpse into Wikipedia and how the commitment to ‘neutrality’ affects coverage of different types of people and event feeds.


Deeds, not words

Recent events have revealed, again, that the systems we inhabit and use as educators are perfectly designed to get the results they get. The stated desire is there to change the systems we use. Let’s be able to look back to this point in two years and say that we have made a genuine difference.

Nick Dennis

Some great questions here from Nick, some of which are specific to education, whereas others are applicable everywhere.


Sign with hole cut out saying 'NO JUSTICE NO PEACE'

Audio Engineers Built a Shield to Deflect Police Sound Cannons

Since the protests began, demonstrators in multiple cities have reported spotting LRADs, or Long-Range Acoustic Devices, sonic weapons that blast sound waves at crowds over large distances and can cause permanent hearing loss. In response, two audio engineers from New York City have designed and built a shield which they say can block and even partially reflect these harmful sonic blasts back at the police.

Janus Rose (Vice)

For those not familiar with the increasing militarisation of police in the US, this is an interesting read.


CMA to look into Facebook’s purchase of gif search engine

The Competition and Markets Authority (CMA) is inviting comments about Facebook’s purchase of a company that currently provides gif search across many of the social network’s competitors, including Twitter and the messaging service Signal.

[…]

[F]or Facebook, the more compelling reason for the purchase may be the data that Giphy has about communication across the web. Since many services that integrate with the platform not only use it to find gifs, but also leave the original clip hosted on Giphy’s servers, the company receives information such as when a message is sent and received, the IP address of both parties, and details about the platforms they are using.

Alex Hern (The Guardian)

In my 2012 TEDx Talk I discussed the memetic power of gifs. Others might find this news surprising, but I don’t think I would have been surprised even back then that it would be such a hot topic in 2020.

Also by the Hern this week is an article on Twitter’s experiments around getting people to actually read things before they tweet/retweet them. What times we live in.


Human cycles: History as science

To Peter Turchin, who studies population dynamics at the University of Connecticut in Storrs, the appearance of three peaks of political instability at roughly 50-year intervals is not a coincidence. For the past 15 years, Turchin has been taking the mathematical techniques that once allowed him to track predator–prey cycles in forest ecosystems, and applying them to human history. He has analysed historical records on economic activity, demographic trends and outbursts of violence in the United States, and has come to the conclusion that a new wave of internal strife is already on its way1. The peak should occur in about 2020, he says, and will probably be at least as high as the one in around 1970. “I hope it won’t be as bad as 1870,” he adds.

Laura Spinney (Nature)

I’m not sure about this at all, because if you go looking for examples of something to fit your theory, you’ll find it. Especially when your theory is as generic as this one. It seems like a kind of reverse fortune-telling?


Universal Basic Everything

Much of our economies in the west have been built on the idea of unique ideas, or inventions, which are then protected and monetised. It’s a centuries old way of looking at ideas, but today we also recognise that this method of creating and growing markets around IP protected products has created an unsustainable use of the world’s natural resources and generated too much carbon emission and waste.

Open source and creative commons moves us significantly in the right direction. From open sharing of ideas we can start to think of ideas, services, systems, products and activities which might be essential or basic for sustaining life within the ecological ceiling, whilst also re-inforcing social foundations.

TessyBritton

I’m proud to be part of a co-op that focuses on openness of all forms. This article is a great introduction to anyone who wants a new way of looking at our post-COVID future.


World faces worst food crisis for at least 50 years, UN warns

Lockdowns are slowing harvests, while millions of seasonal labourers are unable to work. Food waste has reached damaging levels, with farmers forced to dump perishable produce as the result of supply chain problems, and in the meat industry plants have been forced to close in some countries.

Even before the lockdowns, the global food system was failing in many areas, according to the UN. The report pointed to conflict, natural disasters, the climate crisis, and the arrival of pests and plant and animal plagues as existing problems. East Africa, for instance, is facing the worst swarms of locusts for decades, while heavy rain is hampering relief efforts.

The additional impact of the coronavirus crisis and lockdowns, and the resulting recession, would compound the damage and tip millions into dire hunger, experts warned.

Fiona Harvey (The Guardian)

The knock-on effects of COVID-19 are going to be with us for a long time yet. And these second-order effects will themselves have effects which, with climate change also being in the mix, could lead to mass migrations and conflict by 2025.


Mice on Acid

What exactly a mouse sees when she’s tripping on DOI—whether the plexiglass walls of her cage begin to melt, or whether the wood chips begin to crawl around like caterpillars—is tied up in the private mysteries of what it’s like to be a mouse. We can’t ask her directly, and, even if we did, her answer probably wouldn’t be of much help.

Cody Kommers (Nautilus)

The bit about ‘ego disillusion’ in this article, which is ostensibly about how to get legal hallucinogens to market, is really interesting.


Header image by Dmitry Demidov

Friday facings

This week’s links seem to have a theme about faces and looking at them through screens. I’m not sure what that says about either my network, or my interests, but there we are…

As ever, let me know what resonates with you, and if you have any thoughts on what’s shared below!


The Age of Instagram Face

The human body is an unusual sort of Instagram subject: it can be adjusted, with the right kind of effort, to perform better and better over time. Art directors at magazines have long edited photos of celebrities to better match unrealistic beauty standards; now you can do that to pictures of yourself with just a few taps on your phone.

Jia Tolentino (The New Yorker)

People, especially women, but there’s increasing pressure on young men too, are literally going to see plastic surgeons with ‘Facetuned’ versions of themselves. It’s hard not to think that we’re heading for a kind of dystopia when people want to look like cartoonish versions of themselves.


What Makes A Good Person?

What I learned as a child is that most people don’t even meet the responsibilities of their positions (husband, wife, teacher, boss, politicians, whatever.) A few do their duty, and I honor them for it, because it is rare. But to go beyond that and actually be a man of honor is unbelievably rare.

Ian Welsh

This question, as I’ve been talking with my therapist about, is one I ask myself all the time. Recently, I’ve settled on Marcus Aurelius’ approach: “Waste no more time arguing about what a good man should be. Be one.”


Boredom is but a window to a sunny day beyond the gloom

Boredom can be our way of telling ourselves that we are not spending our time as well as we could, that we should be doing something more enjoyable, more useful, or more fulfilling. From this point of view, boredom is an agent of change and progress, a driver of ambition, shepherding us out into larger, greener pastures.

Neel Burton (Aeon)

As I’ve discussed before, I’m not so sure about the fetishisation of ‘boredom’. It’s good to be creative and let the mind wander. But boredom? Nah. There’s too much interesting stuff out there.


Resting Risk Face

Unlock your devices with a surgical mask that looks just like you.

I don’t usually link to products in this roundup, but I’m not sure this is 100% serious. Good idea, though!


The world’s biggest work-from-home experiment has been triggered by coronavirus

For some employees, like teachers who have conducted classes digitally for weeks, working from home can be a nightmare.
But in other sectors, this unexpected experiment has been so well received that employers are considering adopting it as a more permanent measure. For those who advocate more flexible working options, the past few weeks mark a possible step toward widespread — and long-awaited — reform.

Jessie Yeung (CNN)

Every cloud has a silver lining, I guess? Working from home is great, especially when you have a decent setup.


Setting Up Your Webcam, Lights, and Audio for Remote Work, Podcasting, Videos, and Streaming

Only you really know what level of clarity you want from each piece of your setup. Are you happy with what you have? Please, dear Lord, don’t spend any money. This is intended to be a resource if you want more and don’t know how to do it, not a stress or a judgment to anyone happy with their current setup

And while it’s a lot of fun to have a really high-quality webcam for my remote work, would I have bought it if I didn’t have a more intense need for high quality video for my YouTube stuff? Hell no. Get what you need, in your budget. This is just a resource.

This is a fantastic guide. I bought a great webcam when I saw it drop in price via CamelCamelCamel and bought a decent mic when I recorded the TIDE podcast wiht Dai. It really does make a difference.


Large screen phones: a challenge for UX design (and human hands)

I know it might sound like I have more questions than answers, but it seems to me that we are missing out on a very basic solution for the screen size problem. Manufacturers did so much to increase the screen size, computational power and battery capacity whilst keeping phones thin, that switching the apps navigation to the bottom should have been the automatic response to this new paradigm.

Maria Grilo (Imaginary Cloud)

The struggle is real. I invested in a new phone this week (a OnePlus 7 Pro 5G) and, unlike the phone it replaced from 2017, it’s definitely a hold-with-two-hands device.


Society Desperately Needs An Alternative Web

What has also transpired is a web of unbridled opportunism and exploitation, uncertainty and disparity. We see increasing pockets of silos and echo chambers fueled by anxiety, misplaced trust, and confirmation bias. As the mainstream consumer lays witness to these intentions, we notice a growing marginalization that propels more to unplug from these communities and applications to safeguard their mental health. However, the addiction technology has produced cannot be easily remedied. In the meantime, people continue to suffer.

Hessie Jones (Forbes)

Another call to re-decentralise the web, this time based on arguments about centralised services not being able to handle the scale of abuse and fraudulent activity.


UK Google users could lose EU GDPR data protections

It is understood that Google decided to move its British users out of Irish jurisdiction because it is unclear whether Britain will follow GDPR or adopt other rules that could affect the handling of user data.

If British Google users have their data kept in Ireland, it would be more difficult for British authorities to recover it in criminal investigations.

The recent Cloud Act in the US, however, is expected to make it easier for British authorities to obtain data from US companies. Britain and the US are also on track to negotiate a broader trade agreement.

Samuel Gibbs (The Guardian)

I’m sure this is a business decision as well, but I guess it makes sense given post-Brexit uncertainty about privacy legislation. It’s a shame, though, and a little concerning.


Enjoy this? Sign up for the weekly roundup, become a supporter, or download Thought Shrapnel Vol.1: Personal Productivity!


Header image by Luc van Loon

Friday fizzles

I head off on holiday tomorrow! Before I go, check out these highlights from this week’s reading and research:

  • “Things that were considered worthless are redeemed” (Ira David Socol) — “Empathy plus Making must be what education right now is about. We are at both a point of learning crisis and a point of moral crisis. We see today what happens — in the US, in the UK, in Brasil — when empathy is lost — and it is a frightening sight. We see today what happens — in graduates from our schools who do not know how to navigate their world — when the learning in our schools is irrelevant in content and/or delivery.”
  • Voice assistants are going to make our work lives better—and noisier (Quartz) — “Active noise cancellation and AI-powered sound settings could help to tackle these issues head on (or ear on). As the AI in noise cancellation headphones becomes better and better, we’ll potentially be able to enhance additional layers of desirable audio, while blocking out sounds that distract. Audio will adapt contextually, and we’ll be empowered to fully manage and control our soundscapes.
  • We Aren’t Here to Learn What We Already Know (LA Review of Books) — “A good question, in short, is an honest question, one that, like good theory, dances on the edge of what is knowable, what it is possible to speculate on, what is available to our immediate grasp of what we are reading, or what it is possible to say. A good question, that is, like good theory, might be quite unlovely to read, particularly in its earliest iterations. And sometimes it fails or has to be abandoned.”
  • The runner who makes elaborate artwork with his feet and a map (The Guardian) — “The tracking process is high-tech, but the whole thing starts with just a pen and paper. “When I was a kid everyone thought I’d be an artist when I grew up – I was always drawing things,” he said. He was a particular fan of the Etch-a-Sketch, which has something in common with his current work: both require creating images in an unbroken line.”
  • What I Do When it Feels Like My Work Isn’t Good Enough (James Clear) — “Release the desire to define yourself as good or bad. Release the attachment to any individual outcome. If you haven’t reached a particular point yet, there is no need to judge yourself because of it. You can’t make time go faster and you can’t change the number of repetitions you have put in before today. The only thing you can control is the next repetition.”
  • Online porn and our kids: It’s time for an uncomfortable conversation (The Irish Times) — “Now when we talk about sex, we need to talk about porn, respect, consent, sexuality, body image and boundaries. We don’t need to terrify them into believing watching porn will ruin their lives, destroy their relationships and warp their libidos, maybe, but we do need to talk about it.”
  • Drones will fly for days with new photovoltaic engine (Tech Xplore) — “[T]his finding builds on work… published in 2011, which found that the key to boosting solar cell efficiency was not by absorbing more photons (light) but emitting them. By adding a highly reflective mirror on the back of a photovoltaic cell, they broke efficiency records at the time and have continued to do so with subsequent research.
  • Twitter won’t ruin the world. But constraining democracy would (The Guardian) — “The problems of Twitter mobs and fake news are real. As are the issues raised by populism and anti-migrant hostility. But neither in technology nor in society will we solve any problem by beginning with the thought: “Oh no, we put power into the hands of people.” Retweeting won’t ruin the world. Constraining democracy may well do.
  • The Encryption Debate Is Over – Dead At The Hands Of Facebook (Forbes) — “Facebook’s model entirely bypasses the encryption debate by globalizing the current practice of compromising devices by building those encryption bypasses directly into the communications clients themselves and deploying what amounts to machine-based wiretaps to billions of users at once.”
  • Living in surplus (Seth Godin) — “When you live in surplus, you can choose to produce because of generosity and wonder, not because you’re drowning.”

Image from Dilbert. Shared to make the (hopefully self-evident) counterpoint that not everything of value has an economic value. There’s more to life than accumulation.

There’s no viagra for enlightenment

This quotation from the enigmatic Russell Brand seemed appropriate for the subject of today’s article: the impact of so-called ‘deepfakes’ on everything from porn to politics.

First, what exactly are ‘deepfakes’? Mark Wilson explains in an article for Fast Company:

In early 2018, [an anonymous Reddit user named Deepfakes] uploaded a machine learning model that could swap one person’s face for another face in any video. Within weeks, low-fi celebrity-swapped porn ran rampant across the web. Reddit soon banned Deepfakes, but the technology had already taken root across the web–and sometimes the quality was more convincing. Everyday people showed that they could do a better job adding Princess Leia’s face to The Force Awakens than the Hollywood special effects studio Industrial Light and Magic did. Deepfakes had suddenly made it possible for anyone to master complex machine learning; you just needed the time to collect enough photographs of a person to train the model. You dragged these images into a folder, and the tool handled the convincing forgery from there.

Mark Wilson

As you’d expect, deepfakes bring up huge ethical issues, as Jessica Lindsay reports for Metro. It’s a classic case of our laws not being able to keep up with what’s technologically possible:

With the advent of deepfake porn, the possibilities have expanded even further, with people who have never starred in adult films looking as though they’re doing sexual acts on camera.

Experts have warned that these videos enable all sorts of bad things to happen, from paedophilia to fabricated revenge porn.

[…]

This can be done to make a fake speech to misrepresent a politician’s views, or to create porn videos featuring people who did not star in them.

Jessica Lindsay

It’s not just video, either, with Google’s AI now able to translate speech from one language to another and keep the same voice. Karen Hao embeds examples in an article for MIT Technology Review demonstrating where this is all headed.

The results aren’t perfect, but you can sort of hear how Google’s translator was able to retain the voice and tone of the original speaker. It can do this because it converts audio input directly to audio output without any intermediary steps. In contrast, traditional translational systems convert audio into text, translate the text, and then resynthesize the audio, losing the characteristics of the original voice along the way.

Karen Hao

The impact on democracy could be quite shocking, with the ability to create video and audio that feels real but is actually completely fake.

However, as Mike Caulfield notes, the technology doesn’t even have to be that sophisticated to create something that can be used in a political attack.

There’s a video going around that purportedly shows Nancy Pelosi drunk or unwell, answering a question about Trump in a slow and slurred way. It turns out that it is slowed down, and that the original video shows her quite engaged and articulate.

[…]

In musical production there is a technique called double-tracking, and it’s not a perfect metaphor for what’s going on here but it’s instructive. In double tracking you record one part — a vocal or solo — and then you record that part again, with slight variations in timing and tone. Because the two tracks are close, they are perceived as a single track. Because they are different though, the track is “widened” feeling deeper, richer. The trick is for them to be different enough that it widens the track but similar enough that they blend.

Mike Caulfield

This is where blockchain could actually be a useful technology. Caulfield often talks about the importance of ‘going back to the source’ — in other words, checking the provenance of what it is you’re reading, watching, or listening. There’s potential here for checking that something is actually the original document/video/audio.

Ultimately, however, people believe what they want to believe. If they want to believe Donald Trump is an idiot, they’ll read and share things showing him in a negative light. It doesn’t really matter if it’s true or not.


Also check out:

Noise cancelling for cars is a no-brainer

We’re all familiar with noise cancelling headphones. I’ve got some that I use for transatlantic trips, and they’re great for minimising any repeating background noise.

Twenty years ago, when I was studying A-Level Physics, I was also building a new PC. I realised that, if I placed a microphone inside the computer case, and fed that into the audio input on the soundcard, I could use software to invert the sound wave and thus virtually eliminate fan noise. It worked a treat.

It doesn’t surprise me, therefore, to find that BOSE, best known for its headphones, are offering car manufacturers something similar with “road noise control”:

[youtube https://www.youtube.com/watch?v=SIzkgLdzd9g&w=560&h=315]

With accelerometers, multiple microphones, and algorithms, it’s much more complicated than what I rigged up in my bedroom as a teenager. But the principle remains the same.

Source: The Next Web

Audio Adversarial speech-to-text

I don’t usually go in for detailed technical papers on stuff that’s not directly relevant to what I’m working on, but I made an exception for this. Here’s the abstract:

We construct targeted audio adversarial examples on automatic speech recognition. Given any audio waveform, we can produce another that is over 99.9% similar, but transcribes as any phrase we choose (at a rate of up to 50 characters per second). We apply our white-box iterative optimization-based attack to Mozilla’s implementation DeepSpeech end-to-end, and show it has a 100% success rate. The feasibility of this attack introduce a new domain to study adversarial examples.

In other words, the researchers managed to fool a neural network devoted to speech recognition into transcribing a phrase different to that which was uttered.

So how does it work?

By starting with an arbitrary waveform instead of speech (such as music), we can embed speech into audio that should not be recognized as speech; and by choosing silence as the target, we can hide audio from a speech-to-text system

The authors state that merely changing words so that something different occurs is a standard adverserial attack. But a targeted adverserial attack is different:

Not only are we able to construct adversarial examples converting a person saying one phrase to that of them saying a different phrase, we are also able to begin with arbitrary non-speech audio sample and make that recognize as any target phrase.

This kind of stuff is possible due to open source projects, in particular Mozilla Common Voice. Great stuff.
 

Source: Arxiv

Get a Thought Shrapnel digest in your inbox every Sunday (free!)
Holler Box