Edgar
@theotheredgar.bsky.social
1.7K followers 90 following 7 posts
#rstats #visualizations #datascience
Posts Media Videos Starter Packs
Reposted by Edgar
posit.co
Posit @posit.co · Jul 21
Announcing streamlined MLOps with Orbital on Databricks 🛰️🧱

Orbital translates #ScikitLearn #Python or #tidymodels #RStats to native #SQL for direct database model execution.

@theotheredgar.bsky.social's post uses Databricks as an integrated environment.

Learn more: posit.co/blog/databri...
A graphic illustrating data science and big data technologies. On the left, stacked vertically, are the logos for R and Python. In the center, also stacked vertically, are the Orbital logo (featuring a satellite) and the Scala logo (a blue serpent). On the right, stacked vertically, are the Databricks logo and a generic database cylinder icon. The background is a light blue with a subtle, dark blue dot pattern at the bottom.
theotheredgar.bsky.social
Hi! That's is currently in the dev version of the package, we're going to use ellmer/chatlas as the way to get an external integration with LLMs: mlverse.github.io/mall/
mall
mlverse.github.io
Reposted by Edgar
rachaeldempsey.bsky.social
the pins 📌 package gets a lot of love in the chat at community events - and we're excited to share a workflow today!

this one was asked about specifically at a Data Science Hangout!

....we're talking about pins + Databricks with @theotheredgar.bsky.social at 11am ET!

youtu.be/ab4CIlafsbo?...
APRIL 30th: Easier data and asset sharing across projects and teams with {pins} and Databricks
YouTube video by Posit PBC
youtu.be
Reposted by Edgar
camlivio.bsky.social
Shoutout to @ivelasq3.bsky.social and @posit.co for the opportunity to write a blog post about how I'm using `library(mall)` and integrating large language models into our energy security research! #textdata #LLM #energy #energysecurity #socialscience #datascience #NLProc
posit.co
Posit @posit.co · Mar 25
Discover how the mall package simplifies LLM integration in R!

In @camlivio.bsky.social's guest post, she walks through how she uses mall to summarize dense PDF reports, extract key entities, and visualize the frequency of relevant terms, all with #RStats.

Read it here: posit.co/blog/mall-ai...
Illustration demonstrating how an AI-powered system gathers data from various documents and compiles it into a single, organized format, using the mall hex to represent the AI processing.
Reposted by Edgar
posit.co
Posit @posit.co · Mar 25
Discover how the mall package simplifies LLM integration in R!

In @camlivio.bsky.social's guest post, she walks through how she uses mall to summarize dense PDF reports, extract key entities, and visualize the frequency of relevant terms, all with #RStats.

Read it here: posit.co/blog/mall-ai...
Illustration demonstrating how an AI-powered system gathers data from various documents and compiles it into a single, organized format, using the mall hex to represent the AI processing.
Reposted by Edgar
posit.co
Posit @posit.co · Mar 10
We're excited to announce a new release of odbc!

This release includes a new hex logo (thanks, @theotheredgar.bsky.social!), viewer-based credentials on Posit Connect for `databricks()` and `snowflake()`, and more.

Read more in the release notes: odbc.r-dbi.org/news/index.h...

#RStats
the odbc hex logo
Reposted by Edgar
simonpcouch.com
odbc 1.6.0 is now on #rstats CRAN! Includes a new helper for Redshift, a hex sticker (finally!), and many QOL improvements for Databricks, Snowflake, MSSQL, etc.

Read more: odbc.r-dbi.org/news/index.h...
A hexagonal sticker with an all black background and "odbc" written in layered shades of blue.
Reposted by Edgar
christiannolan.bsky.social
The 2nd wave of uv is here — developers of other systems building on top of uv to make new and effortless workflows that use Python virtual environments behind the scenes 🤩
posit.co
Posit @posit.co · Mar 3
We are delighted to announce Reticulate 1.41!

You can now simply declare your dependencies using `py_require()`, and Reticulate will handle the rest. This seamless experience is powered by uv, an extremely fast #Python package manager written in Rust.

Learn more: posit.co/blog/reticul...

#RStats
The reticulate hex plus the uv logo
Reposted by Edgar
ramikrispin.bsky.social
(1/3) Every week, I review an open-source project in my newsletter. This week, the focus is on the lang project.

The lang library enables the translation of any function's documentation to a different language by using LLM on the fly.

github.com/mlverse/lang

#RStats #LLM #AI
GitHub - mlverse/lang: Uses LLMs to translate R help docs on the fly
Uses LLMs to translate R help docs on the fly. Contribute to mlverse/lang development by creating an account on GitHub.
github.com
Reposted by Edgar
posit.co
Posit @posit.co · Feb 6
Introducing the {lang} package by @theotheredgar.bsky.social for translating R help using your local LLM! Lea la ayuda en #RStats en su propio idioma!

lang helps you translate your documentation and include it as part of your package.

Check it out here! github.com/mlverse/lang
mlverse package with an RStudio help window in Spanish
Reposted by Edgar
posit.co
Posit @posit.co · Dec 18
Pin your data and model objects to Databricks Volumes in #RStats!

With pins, you can store an object on a board, like Dropbox, Posit Connect, or Amazon S3.

We have merged support for pinning objects to Databricks Volumes with the `board_databricks()` function!

Read more: posit.co/blog/pins-in...
Pins in Databricks - Posit
The pins R package now has support for the `board_databricks()` function, which allows you to access and store pins in Databrick’s Volumes from your R script.
posit.co
Reposted by Edgar
juliasilge.com
The recent work that @theotheredgar.bsky.social did in the pins #rstats package to support Databricks Volumes is QUITE NICE for folks who need more flexibility in how they store objects/files there! 📌

Read a bit about it here:
posit.co/blog/pins-in...
Pins in Databricks - Posit
The pins R package now has support for the `board_databricks()` function, which allows you to access and store pins in Databrick’s Volumes from your R script.
posit.co
theotheredgar.bsky.social
📦 Lea la ayuda en R en su propio idioma! Es un gusto de introducir a {lang}, traduce la ayuda 'en vivo' utilizando Ollama y lo muestra en la misma ventana de 'help' en su entorno de desarrollo github.com/mlverse/lang #rstats #ollama #llm
GitHub - mlverse/lang: Uses LLMs to translate R help docs on the fly
Uses LLMs to translate R help docs on the fly. Contribute to mlverse/lang development by creating an account on GitHub.
github.com
theotheredgar.bsky.social
Dev 📦 alert! {lang} translates R help on-the-fly using your local LLM! It also overrides the `?` so you can easily access the translated docs and have them displayed on your IDE's help pane github.com/mlverse/lang #rstats #llm #ollama
GitHub - mlverse/lang: Uses LLMs to translate R help docs on the fly
Uses LLMs to translate R help docs on the fly. Contribute to mlverse/lang development by creating an account on GitHub.
github.com
theotheredgar.bsky.social
Hi, as long as 'mall' is imported, the new Polars DF should automatically have `llm` namespace

import polars as pl
import mall
f = open("test.csv", "w")
f.write("text\n\"I am happy\"\n")
f.close()
df = pl.read_csv("test.csv")
df.llm.sentiment("text")
Reposted by Edgar
posit.co
Posit @posit.co · Nov 21
Introducing the mall package for running multiple LLM predictions against a data frame in #RStats or #Python!

mall is inspired by the SQL AI functions offered by vendors such as Databricks and Snowflake.

Learn more in this blog post by @theotheredgar.bsky.social: blogs.rstudio.com/ai/posts/202...
Posit AI Blog: Introducing mall for R...and Python
We are proud to introduce the {mall}. With {mall}, you can use a local LLM to run NLP operations across a data frame. (sentiment, summarization, translation, etc). {mall} has been simultaneusly rele...
blogs.rstudio.com
theotheredgar.bsky.social
📦 In today's, cool things {mall} can do: You can set the values returned per sentiment. Saving you the need for the extra step, and making the code more concise #rstats #pydata #polars #ollama
A screenshot of two examples, one from R and the other from Python. It shows how you can set the 1 and 0 to represent positive and negative.
theotheredgar.bsky.social
New 📦 alert! {mall} is out now for both #rstats...and #python! The package uses #llm 's to run NLP operations recursively over a data frame (sentiment, summarization, translation, etc). For Python it's a #polars extension. Both use #ollama to interact with the LLM. mlverse.github.io/mall/
mall's homepage screenshot Screenshot of the results from using mall with Polars Results of using mall with R
Reposted by Edgar
ivelasq3.bsky.social
Are you a Spark user who prefers writing in R? User-defined functions with sparklyr might be what you need ✨

With `spark_apply()`, you can write functions in #RStats and use them in #Spark queries.

Learn more in the blog post: posit.co/blog/databri...
Parallelize R code using user-defined functions (UDFs) in sparklyr - Posit
The sparklyr package enables writing user-defined functions (UDFs) in R, which allow you to leverage Spark for efficient big data processing.
posit.co
Reposted by Edgar
topepo.bsky.social
A new version of the #rstats probably package is on CRAN. A minor update with a bug fix and under-the-hood changes for the upcoming tune version.

But there’s finally a hex logo (thanks to Edgar Ruiz) so we have that going for us. Which is nice.

probably.tidymodels.org