Lightnews — Scholar-powered news

Eric Knowles

@ericdknowles.bsky.social

1.1K followers 290 following 350 posts

Social and political psychologist. NYU professor. Music appreciator. Californian in Manhattan.

Posts Replies Media Videos

Eric Knowles

@ericdknowles.bsky.social

This was a paper with @lindatropp.bsky.social and Mao Mogami. 🙂

psycnet.apa.org/record/2022-...

APA PsycNet

psycnet.apa.org

January 8, 2026 at 6:52 PM

Reposted by Eric Knowles

Amy Klobuchar

@amyklobuchar.com

By refusing to coordinate with local law enforcement, ICE is not making our community safe. It is making it less safe.

www.startribune.com/she-was-an-a...

‘She was an amazing human being’: Mother identifies woman shot, killed by ICE agent

Renee Nicole Good, 37, lived in Minneapolis with her partner just blocks from where she was shot.

www.startribune.com

January 8, 2026 at 1:48 PM

Eric Knowles

@ericdknowles.bsky.social

[9]

lexichron is a work-in-progress but stable for many use cases. Check out the README and included Jupyter notebooks for more info..

I welcome your feedback!

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

[8]

Analysis resources:

- Easily compute cosine similarities and plot temporal trends

- Track and visualize semantic drift for individual words

- Compute WEATs

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

[7]

Training and evaluation pipelines:

- Hyperparameter tuning: architecture (SGNS vs. CBOW), vector size, context window, weighting strategy, etc.

- Model evaluation using similarity and analogy benchmarks

- Visualizations comparing model quality

- Regressions quantifying hyperparameter impact

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

[6]

Purpose-built for HPC clusters and cloud-computing. Pipelines that take weeks on a laptop run in minutes or hours on a cluster. (Fewer cores? It'll work, just slower.)

lexichron truly shines with 30+ CPUs, 80+ GB RAM, and fast NVMe SSD storage.

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

[5]

Key features:

- Tunable preprocessing, including case normalization, lemmatization, stop-word removal, and spell-checking

- Vocabulary whitelisting for efficient filtering

- Bigram preservation, allowing retention of semantically interesting word pairs

- RocksDB databases for fast queries

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

[4]

Both data-prep pipelines feed into a single word2vec training module (gensim's implementation) for training, normalizing, and aligning yearly models to capture semantic change.

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

[3]

lexichron implements two data-prep pipelines:

- Google Ngrams: download and filter 1–5 grams in 8 languages

- Mark Davies' corpora: process datasets from English-Corpora.org containing year and genre metadata (e.g., COHA, COCA; this requires a license and access to the corpus files.)

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

[2]

lexichron is for tracking conceptual shifts across years and decades. It efficiently prepares, filters, trains models on, and analyzes results from extremely large corpora.

Although it can be tuned for personal computers, lexichron is designed for parallelization on HPCs and cloud platforms.

January 4, 2026 at 2:49 AM

Eric Knowles

@ericdknowles.bsky.social

I think dominance drives Trump more than RWA; "world-as-competitive-jungle" > "world-as-impure/dangerous." He lacks the moral intuitions that emanate from the latter. But I also think he's an expert at leveraging voters' fear- and purity-based concerns to satisfy his dominance motives.

January 4, 2026 at 12:07 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news