Posts tagged text

Better Language Models and Their Implications

2019-02-20 gpt2, ML, AI, text, text-generation, generative, OpenAI, 2019

GPT-2 displays a broad set of capabilities, including the ability to generate conditional synthetic text samples of unprecedented quality, where we prime the model with an input and have it generate a lengthy continuation. In addition, GPT-2 outperforms other language models trained on specific domains (like Wikipedia, news, or books) without needing to use these domain-specific training datasets. On language tasks like question answering, reading comprehension, summarization, and translation, GPT-2 begins to learn these tasks from the raw text, using no task-specific training data. While scores on these downstream tasks are far from state-of-the-art, they suggest that the tasks can benefit from unsupervised techniques, given sufficient (unlabeled) data and compute.

via https://blog.openai.com/better-language-models/

I Can Text You A Pile of Poo, But I Can’t Write My Name - Aditya Mukerjee

2016-12-06 culture, unicode, text, writing, language, emoji

The evolution of emoji is impressive and fascinating, but it makes for an uncomfortable contrast when other pictorial writing systems – the most commonly-used writing systems on the planet – are on the chopping block. We have an unambiguous, cross-platform way to represent “PILE OF POO” (💩), while we’re still debating which of the 1.2 billion native Chinese speakers deserve to spell their own names correctly.

via https://modelviewculture.com/pieces/i-can-text-you-a-pile-of-poo-but-i-cant-write-my-name

Typewriter poems by Erik Blagsvedt, 2016.

2016-11-30 retro, text, type, poetry, Erik Blagsvedt, 2016

L1027777 (via http://flic.kr/p/Nsg1v5 )

2016-11-05 jp, tokyo, decay, text, texture, f24, leicam9, ¹⁄₃₀sec, leicasummiluxm35mmf14asph, iso640, ¹⁄₃₀secat

L1027777 (via http://flic.kr/p/Nsg1v5 )

L1027543 (via http://flic.kr/p/NKc2iV )

2016-11-04 guanyin, jp, kaga, kannon, nataderatemple, fadedlettering, text, type, 那谷寺, f24, leicam9, ¹⁄₁₂₅sec

L1027543 (via http://flic.kr/p/NKc2iV )

How to Run Text Summarization with TensorFlow

2016-10-23 Medium, text, text summarisation, machine learning, tensorflow

Text summarization problem has many useful applications. If you run a website, you can create titles and short summaries for user generated content. If you want to read a lot of articles and don’t have time to do that, your virtual assistant can summarize main points from these articles for you. It is not an easy problem to solve. There are multiple approaches, including various supervised and unsupervised algorithms. Some algorithms rank the importance of sentences within the text and then construct a summary out of important sentences, others are end-to-end generative models. End-to-end machine learning algorithms are interesting to try. After all, end-to-end algorithms demonstrate good results in other areas, like image recognition, speech recognition, language translation, and even question-answering.

via https://medium.com/@surmenok/how-to-run-text-summarization-with-tensorflow-d4472587602d

Purposefully illegible text from 12th & 21st centuries: A is unreadable by humans; B unreadable by digital scanners

2016-08-19 legiblity, text, type, writing, obscurity, obsfucation, 1100s, 2000s

Purposefully illegible text from 12th & 21st centuries: A is unreadable by humans; B unreadable by digital scanners

2,000-Year-Old Scrolls Inscribed With Ancient Curses Uncovered in Serbia

2016-08-12 archeology, religion, text, languages, curses, gods, demons, Baal, Yahweh, Thobarabau, Seneseilam, S

Archaeologists working in Serbia have discovered tiny parchments of gold and silver inscribed with what appears to be a series of ancient curses. The curse tablets were found alongside human skeletons at an excavation site at the foot of a coal-fired power station in Kostolac in northeastern Serbia. Archaeologists led by Miomir Korać are currently scouring the area in preparation for further construction at the site, which was once home to the ancient Roman city of Viminacium. One of the newly discovered scrolls contains text written in ancient Aramaic, and not Greek. That presents a mystery to the scientists, but it’s also an important clue. The researchers have identified several demons associated with the territory of what is today Syria, including Baal, Yahweh, Thobarabau, Seneseilam, and Sesengenfaranges. Invoking the powers of both Baal and Yahweh on a single tablet is unprecedented.

via http://www.gizmodo.co.uk/2016/08/2000-year-old-scrolls-inscribed-with-ancient-curses-uncovered-in-serbia/

NEVIR (via http://flic.kr/p/y73FFP )

2015-09-01 light, brussels, text, bruxelles, be, type, brussel, bruxxel, f24, iso160, brüsel, leicasummiluxm35m

NEVIR (via http://flic.kr/p/y73FFP )

a Guest + a Host = Ghost

2015-08-30 Duchamp, host, guest, ghost, hauntology, text, FR

a Guest + a Host = Ghost

(via http://flic.kr/p/ttGi2j )

2015-05-24 uk, london, sign, text, type, iso160, f67, lookagain, leicasummiluxm35mmf14asph, leicam9, ¹⁄₄₅sec, ¹

(via http://flic.kr/p/ttGi2j )

Editing Finnegans Wake

2015-01-03 james joyce, joyce, editorial, editing, text, writing, markup, marginalia

Editing Finnegans Wake

Search, compile, publish.

2014-02-19 text, internet, digital, post-digital, art, artist books, library of the printed web

Looking through the works, you see artists sifting through enormous accumulations of images and texts. They do it in various ways—hunting, grabbing, compiling, publishing. They enact a kind of performance with the data, between the web and the printed page, negotiating vast piles of existing material. Almost all of the artists here use the search engine, in one form or another, for navigation and discovery.

http://soulellis.com/2013/05/search-compile-publish/

Groningen Meaning Bank

2012-11-30 AI, NLP, language, semantics, syntax, text, machine learning

The Groningen Meaning Bank consists of public domain English texts with corresponding syntactic and semantic representations.

http://gmb.let.rug.nl/documentation.php