Data Science Renee
@becomingdatasci.bsky.social
Author of SQL for Data Scientists (Wiley), Senior Director of Data Science at HelioCampus, creator of Becoming a Data Scientist podcast
She/her
https://sqlfordatascientists.com
She/her
https://sqlfordatascientists.com
Pinned
SQL for Data Scientists | author's book companion website
sqlfordatascientists.com
If you know anyone who's interested in starting a career in data, check out my book SQL for Data Scientists!
sqlfordatascientists.com
sqlfordatascientists.com
Heads up that Manning is having a half-off books sale. They have a lot of good data science titles. I was interviewed for this one 😊
www.manning.com/books/build-...
www.manning.com/books/build-...
Build a Career in Data Science - Emily Robinson and Jacqueline Nolis
A guide to landing your first data science job and developing into a valued senior employee. Learn how to craft an amazing resume and ace your interviews.
www.manning.com
November 1, 2025 at 4:00 PM
Heads up that Manning is having a half-off books sale. They have a lot of good data science titles. I was interviewed for this one 😊
www.manning.com/books/build-...
www.manning.com/books/build-...
If you ever see me post non-data-related stuff here, it's because I forgot that I switched accounts 😅 If you're interested in following my personal account, it's @paix120.bsky.social
October 31, 2025 at 5:05 PM
If you ever see me post non-data-related stuff here, it's because I forgot that I switched accounts 😅 If you're interested in following my personal account, it's @paix120.bsky.social
Reposted by Data Science Renee
🤔 @becomingdatasci.bsky.social new way to learn SQL just dropped
This paper introduces the LEGO Database, a large natural dataset that can be used to teach Structured Query Language (SQL) and relational database concepts.
ERIC - EJ1468081 - Using LEGO® Brick Data to Teach SQL and Relational Database Concepts, Information Systems Education Journal, 2025
This paper introduces the LEGO® Database, a large natural dataset that can be used to teach Structured Query Language (SQL) and relational database concepts. This dataset is well-suited for introductory and advanced database assignments and end-of-semester group projects. The data is freely available from Kaggle.com and contains eight tables with 633,250 rows of data on 11,673 LEGO® sets sold between 1950 and 2017. As a guiding example, I introduce an example group project assignment designed to provide students hands-on experience with database management and SQL queries. I also discuss tips, suggestions, and lessons learned from using the data for group projects over the past five years. While LEGO® bricks have been widely used in educational settings, including college and computer classrooms, this is the first work to discuss the use of LEGO® data in a college database course.
eric.ed.gov
October 19, 2025 at 3:26 AM
🤔 @becomingdatasci.bsky.social new way to learn SQL just dropped
Looking forward to reading this
WIRED's AI package launches today: 17 stories about how AI is changing us and the world we live in, from AI weaponry to what happens when the bubble bursts
Read them all here @wired.com:
www.wired.com/ai-issue/
Read them all here @wired.com:
www.wired.com/ai-issue/
October 27, 2025 at 1:30 PM
Looking forward to reading this
Reposted by Data Science Renee
October 16, 2025 at 10:06 PM
Reposted by Data Science Renee
Apparently there are a bunch of new people coming over from Twitter... Drop your data science starter packs in the replies for people to follow!
October 16, 2025 at 3:23 AM
Apparently there are a bunch of new people coming over from Twitter... Drop your data science starter packs in the replies for people to follow!
Reposted by Data Science Renee
Some of my favs: bsky.app/profile/did:...
October 16, 2025 at 11:44 AM
Some of my favs: bsky.app/profile/did:...
Reposted by Data Science Renee
October 16, 2025 at 11:47 AM
Reposted by Data Science Renee
Two "data people" ones:
1: bsky.app/starter-pack...
2: bsky.app/starter-pack...
and this is where people can search starter packs (my go-to resource for those new to Bluesky!): blueskydirectory.com/starter-packs
1: bsky.app/starter-pack...
2: bsky.app/starter-pack...
and this is where people can search starter packs (my go-to resource for those new to Bluesky!): blueskydirectory.com/starter-packs
October 16, 2025 at 12:16 PM
Two "data people" ones:
1: bsky.app/starter-pack...
2: bsky.app/starter-pack...
and this is where people can search starter packs (my go-to resource for those new to Bluesky!): blueskydirectory.com/starter-packs
1: bsky.app/starter-pack...
2: bsky.app/starter-pack...
and this is where people can search starter packs (my go-to resource for those new to Bluesky!): blueskydirectory.com/starter-packs
Apparently there are a bunch of new people coming over from Twitter... Drop your data science starter packs in the replies for people to follow!
October 16, 2025 at 3:23 AM
Apparently there are a bunch of new people coming over from Twitter... Drop your data science starter packs in the replies for people to follow!
Reposted by Data Science Renee
I would love to see the AI skill training that Accenture used. Anyone interested in leaking it to me?
www.cnbc.com/2025/09/26/a...
www.cnbc.com/2025/09/26/a...
Accenture plans on 'exiting' staff who can't be reskilled on AI amid restructuring strategy
Accenture CEO Julie Sweet said as advanced AI becomes core to the company's strategy, employees are expected to "retrain and retool" at scale.
www.cnbc.com
October 2, 2025 at 2:03 AM
I would love to see the AI skill training that Accenture used. Anyone interested in leaking it to me?
www.cnbc.com/2025/09/26/a...
www.cnbc.com/2025/09/26/a...
No.
October 1, 2025 at 8:06 PM
No.
Who are the best voices in the "future, given AI" analysis space?
Like,writing about a potential point at which there isn't enough publicly-available human-generated raw content to train them on.
Or where there aren't enough experienced staff for a role because AI impacted the workforce pipeline
Like,writing about a potential point at which there isn't enough publicly-available human-generated raw content to train them on.
Or where there aren't enough experienced staff for a role because AI impacted the workforce pipeline
September 28, 2025 at 4:50 PM
Who are the best voices in the "future, given AI" analysis space?
Like,writing about a potential point at which there isn't enough publicly-available human-generated raw content to train them on.
Or where there aren't enough experienced staff for a role because AI impacted the workforce pipeline
Like,writing about a potential point at which there isn't enough publicly-available human-generated raw content to train them on.
Or where there aren't enough experienced staff for a role because AI impacted the workforce pipeline
Reposted by Data Science Renee
Lazy post since I haven't yet searched for an answer:
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
September 13, 2025 at 4:38 PM
Lazy post since I haven't yet searched for an answer:
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
Reposted by Data Science Renee
anybody got a good <1B?
Can you give examples of some of these models? I want to try them out
September 13, 2025 at 4:36 PM
anybody got a good <1B?
Lazy post since I haven't yet searched for an answer:
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
September 13, 2025 at 4:38 PM
Lazy post since I haven't yet searched for an answer:
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
How would you determine if you're good at prompt engineering? Are there tests? Are there categories, like you can hone skills for a specific type of Gen AI use case, or specific models?
Reposted by Data Science Renee
I did the first of two of these yesterday, and more than 50 early career scientists from at least 5 continents joined us! The second is tomorrow afternoon, and there's still time to sign up
ICYMI, later this week (in 2 different time slots) I am offering a free online crash-course in how to write and publish a scientific paper. This course is ideal for grad students and early career researchers interested in writing their first paper.
Info and signup below:
🧪🦑🌎
Info and signup below:
🧪🦑🌎
I’m offering a free online crash course in scientific writing and publishing. Here’s how to join!
The world of scientific writing and pubilshing is complex and confusing, and it can be hard for early career scientists to master. But don’t worry! I am an experienced and award-winning scien…
www.southernfriedscience.com
September 11, 2025 at 2:03 PM
I did the first of two of these yesterday, and more than 50 early career scientists from at least 5 continents joined us! The second is tomorrow afternoon, and there's still time to sign up
Warning to those who work with people's private data that this is the stuff of nightmares
Gift link
Gift link
DOGE put the personal information of hundreds of millions of Americans at risk by uploading Social Security data to a vulnerable cloud server, a whistle-blower complaint said (gift link)
www.nytimes.com/2025/08/26/u...
www.nytimes.com/2025/08/26/u...
DOGE Put Critical Social Security Data at Risk, Whistle-Blower Says
www.nytimes.com
August 26, 2025 at 6:56 PM
Warning to those who work with people's private data that this is the stuff of nightmares
Gift link
Gift link
Reposted by Data Science Renee
In a first, Google has released data on how much energy an AI prompt uses www.technologyreview.com/2025/08/21/1...
In a first, Google has released data on how much energy an AI prompt uses
It’s the most transparent estimate yet from one of the big AI companies, and a long-awaited peek behind the curtain for researchers.
www.technologyreview.com
August 21, 2025 at 1:30 PM
In a first, Google has released data on how much energy an AI prompt uses www.technologyreview.com/2025/08/21/1...
Reposted by Data Science Renee
many benchmarks used to measure AI capabilities are, I think, contrived and lenient. here's a good real-life study, on whether AI can do your (US) tax returns; a domain with plentiful training data and documentation. the result: the best model only got 33% of returns correct arxiv.org/pdf/2507.16126
August 19, 2025 at 4:02 PM
many benchmarks used to measure AI capabilities are, I think, contrived and lenient. here's a good real-life study, on whether AI can do your (US) tax returns; a domain with plentiful training data and documentation. the result: the best model only got 33% of returns correct arxiv.org/pdf/2507.16126
Same with Data Science, and understanding the aspects of the work that goes beyond training models.
If you understand what software engineering as a career actually entails (lots of people and organizational problems, understanding legacy code and tradeoffs) you are at a career advantage over those who understand the job as just producing code.
Those jobs could be replaced. But that ain’t the job
Those jobs could be replaced. But that ain’t the job
August 12, 2025 at 6:28 PM
Same with Data Science, and understanding the aspects of the work that goes beyond training models.
Reposted by Data Science Renee
I agree
The burden of proof needs to be on the people arguing that AI *is* useful for any given task, the counting letters and silly maps make the point that we can’t give these systems a blanket presumption of competence, and that’s important.
August 9, 2025 at 5:23 AM
I agree
I agree
The burden of proof needs to be on the people arguing that AI *is* useful for any given task, the counting letters and silly maps make the point that we can’t give these systems a blanket presumption of competence, and that’s important.
August 9, 2025 at 5:23 AM
I agree
Bookshop apparently has free shipping right now, a good time to pick up my book and others! 😊
SQL for Data Scientists
bookshop.org/p/books/sql-...
SQL for Data Scientists
bookshop.org/p/books/sql-...
SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis
A Beginner's Guide for Building Datasets for Analysis
bookshop.org
July 9, 2025 at 7:02 PM
Bookshop apparently has free shipping right now, a good time to pick up my book and others! 😊
SQL for Data Scientists
bookshop.org/p/books/sql-...
SQL for Data Scientists
bookshop.org/p/books/sql-...
Reposted by Data Science Renee
The Senate parliamentarian is asking the Senate Commerce Committee to rework its 10-year moratorium on enforcing state AI laws.
Parliamentarian requests AI moratorium rewrite
At issue is the scope of funding that will be conditioned on states complying with a 10-year pause on enforcing their AI laws.
www.politico.com
June 26, 2025 at 7:21 PM
The Senate parliamentarian is asking the Senate Commerce Committee to rework its 10-year moratorium on enforcing state AI laws.