Neelesh Salian
@neelesh.bsky.social
Engineer. Data product builder.
Some open-source Spark drama if you want some
lists.apache.org/thread/0558x...
lists.apache.org/thread/0558x...
lists.apache.org
March 15, 2025 at 8:48 PM
Some open-source Spark drama if you want some
lists.apache.org/thread/0558x...
lists.apache.org/thread/0558x...
Executives: "We need Gen AI"
Engineers:
Engineers:
March 2, 2025 at 6:12 AM
Executives: "We need Gen AI"
Engineers:
Engineers:
Mentally I’m still at JDK 8 and 11. 23!
Image borrowed from LinkedIn
Image borrowed from LinkedIn
February 13, 2025 at 5:51 PM
Mentally I’m still at JDK 8 and 11. 23!
Image borrowed from LinkedIn
Image borrowed from LinkedIn
What would it take for people on LinkedIn to stop posting about Deepseek?
I think we should shut it down for a few days and see if people care about it.
Most are chasing the trend and spewing random posts.
I think we should shut it down for a few days and see if people care about it.
Most are chasing the trend and spewing random posts.
February 4, 2025 at 1:07 AM
What would it take for people on LinkedIn to stop posting about Deepseek?
I think we should shut it down for a few days and see if people care about it.
Most are chasing the trend and spewing random posts.
I think we should shut it down for a few days and see if people care about it.
Most are chasing the trend and spewing random posts.
How would this integrate? Perhaps enhancing the Dynamic tables?
BYOC streaming is so hot right now. Rumors swirling around a potential $1.5B acquisition of Redpanda by Snowflake.
January 30, 2025 at 7:15 PM
How would this integrate? Perhaps enhancing the Dynamic tables?
When they invite the Engineer to present to the C Suite
January 30, 2025 at 4:01 PM
When they invite the Engineer to present to the C Suite
It’s telling how AI as an industry is progressing when you see the people freaking out about Deepseek are mostly VCs.
January 27, 2025 at 12:54 AM
It’s telling how AI as an industry is progressing when you see the people freaking out about Deepseek are mostly VCs.
Deepseek is everywhere.
January 26, 2025 at 9:41 PM
Deepseek is everywhere.
Found this comprehensive guide to AI and how to make sense of the various concepts.
www.leverage.to/learn/dev/ai...
www.leverage.to/learn/dev/ai...
The 6 AI Engineering Patterns In 2025
Learning how to build AI powered products
www.leverage.to
January 25, 2025 at 10:49 PM
Found this comprehensive guide to AI and how to make sense of the various concepts.
www.leverage.to/learn/dev/ai...
www.leverage.to/learn/dev/ai...
Tomorrow is restart your Apple TV+ subscription for Severance Season 2 day.
Also, maybe catch Silo.
And re watch Slow Horses.
It’s going to be a long work day sigh.
Also, maybe catch Silo.
And re watch Slow Horses.
It’s going to be a long work day sigh.
January 17, 2025 at 4:16 AM
Tomorrow is restart your Apple TV+ subscription for Severance Season 2 day.
Also, maybe catch Silo.
And re watch Slow Horses.
It’s going to be a long work day sigh.
Also, maybe catch Silo.
And re watch Slow Horses.
It’s going to be a long work day sigh.
Onehouse with their Compute Runtime across table formats. It seems like a good direction but I’m curious if those benchmarks are accurate. 30x query perf, 10x faster writes, I wish there was a demo or something to showcase these.
www.onehouse.ai/blog/introdu...
www.onehouse.ai/blog/introdu...
Introducing Onehouse Compute Runtime to Accelerate Lakehouse Workloads Across All Engines
Learn how Onehouse Compute Runtime delivers an independent, universal foundation to accelerate queries 2-30x and reduce cloud infrastructure bills 20-80% with adaptive workload optimizations, serverle...
www.onehouse.ai
January 16, 2025 at 5:39 PM
Onehouse with their Compute Runtime across table formats. It seems like a good direction but I’m curious if those benchmarks are accurate. 30x query perf, 10x faster writes, I wish there was a demo or something to showcase these.
www.onehouse.ai/blog/introdu...
www.onehouse.ai/blog/introdu...
Tobiko acquired Quary: tobikodata.com/tobiko-acqui...
The transformation game is back on again
The transformation game is back on again
Tobiko - Tobiko Acquires Quary: Evolving the Gold Standard for Data Engineers
tobikodata.com
January 15, 2025 at 9:50 PM
Tobiko acquired Quary: tobikodata.com/tobiko-acqui...
The transformation game is back on again
The transformation game is back on again
To whomsoever it may concern: Acquisitions are great. Integrations are hard. Good luck.
January 14, 2025 at 5:15 PM
To whomsoever it may concern: Acquisitions are great. Integrations are hard. Good luck.
Worth reading this blog about Arrow: arrow.apache.org/blog/2025/01...
How the Apache Arrow Format Accelerates Query Result Transfer
Arrow speeds up query result transfer by slashing (de)serialization overheads. We outline five key attributes of the Arrow format that enable this.
arrow.apache.org
January 13, 2025 at 6:06 PM
Worth reading this blog about Arrow: arrow.apache.org/blog/2025/01...
How AI is messing up things across the board
Also, some things could be a Slack message than an email
Also, some things could be a Slack message than an email
January 11, 2025 at 4:21 PM
How AI is messing up things across the board
Also, some things could be a Slack message than an email
Also, some things could be a Slack message than an email
Anytime I open LinkedIn it’s either
- Random person with a take on AI, usually hyping
- Someone talking about random topics like why is Test Driven Development
- At least one reference to Clean Code
I can’t imagine how that platform is useful to anyone who wants to get an idea of the industry.
- Random person with a take on AI, usually hyping
- Someone talking about random topics like why is Test Driven Development
- At least one reference to Clean Code
I can’t imagine how that platform is useful to anyone who wants to get an idea of the industry.
January 11, 2025 at 3:32 AM
Anytime I open LinkedIn it’s either
- Random person with a take on AI, usually hyping
- Someone talking about random topics like why is Test Driven Development
- At least one reference to Clean Code
I can’t imagine how that platform is useful to anyone who wants to get an idea of the industry.
- Random person with a take on AI, usually hyping
- Someone talking about random topics like why is Test Driven Development
- At least one reference to Clean Code
I can’t imagine how that platform is useful to anyone who wants to get an idea of the industry.
Most companies need a better backend API than anything close to AI or LLMs. Just do the basic thing right first. You don’t need to jump to the fancy objects.
January 10, 2025 at 4:28 AM
Most companies need a better backend API than anything close to AI or LLMs. Just do the basic thing right first. You don’t need to jump to the fancy objects.
Somebody said it
Here is why I consider AI agents nothing more than software. I'm talking specifically about the pieces of software that do things in a pipeline to complete a task. Let's take booking a flight to Vancouver as an example task.
January 10, 2025 at 4:02 AM
Somebody said it
What the Modern Data Stack felt like.
January 9, 2025 at 5:14 AM
What the Modern Data Stack felt like.
2025 is the year of realizing that hourly batch or micro batch is sufficient over continuous processing. The trick is to ask what your stakeholders actually expect to happen and then build accordingly.
January 2, 2025 at 6:29 PM
2025 is the year of realizing that hourly batch or micro batch is sufficient over continuous processing. The trick is to ask what your stakeholders actually expect to happen and then build accordingly.
Happy New Notebook Day. My first MUJI notebook. The pen has been great! Got these on my trip to Seoul.
January 1, 2025 at 6:30 PM
Happy New Notebook Day. My first MUJI notebook. The pen has been great! Got these on my trip to Seoul.
The Data Engineer vs the Data itself
December 30, 2024 at 8:00 AM
The Data Engineer vs the Data itself
My 2025 New Year’s Resolution is to tell everyone in the Data and AI space that you can’t build good AI without quality data. Focus on fundamentals rather than fancy.
December 29, 2024 at 5:46 AM
My 2025 New Year’s Resolution is to tell everyone in the Data and AI space that you can’t build good AI without quality data. Focus on fundamentals rather than fancy.
Looks like PySpark, Ray, and Dask have a competitor - Bodo. Has anyone tried it yet? This is the claimed execution time.
github.com/bodo-ai/Bodo
github.com/bodo-ai/Bodo
December 16, 2024 at 7:18 PM
Looks like PySpark, Ray, and Dask have a competitor - Bodo. Has anyone tried it yet? This is the claimed execution time.
github.com/bodo-ai/Bodo
github.com/bodo-ai/Bodo
I'd be surprised if it gets practical usage in the next 10 years.
Is quantum computing real? Or is that a fantasy future tech of my youth
December 12, 2024 at 5:53 AM
I'd be surprised if it gets practical usage in the next 10 years.