Scholar

Flora D. Salim

H-index: 33
Computer science 53%
Engineering 39%
florasalim.bsky.social
Finally, we compared Bisecle to frontier models like GPT-4o and Gemini 2.5. Note that we can only use APIs to test them. We show that these frontier LLMs still struggle with temporal reasoning and dealing with non-stationary, evolving video tasks. In some tasks, Bisecle show superior performance
Bisecle: Binding and Separation in Continual Learning for Video Language Understanding
Frontier vision-language models (VLMs) have made remarkable improvements in video understanding tasks. However, real-world videos typically exist as continuously evolving data streams (e.g., dynamic s...
arxiv.org
florasalim.bsky.social
- Bisecle exhibits remarkable resistance to forgetting
- Bisecle is compatible with LLMs from 1B to 13B, introducing only a small number of additional parameters and computational cost.
- Bisecle can achieve superior performance even in low-resource settings.
florasalim.bsky.social
The results show that:
- Bisecle establishes a new SOTA results surpassing others in both accuracy (+15.79%) and forgetting reduction (8.49% lower Forgetting rate).
- Our method Bisecle consistently outperforms others, indicating strong robustness even when training data is limited.
florasalim.bsky.social
The two components of Bisecle with complementary angles:
- multi-directional supervision mechanism improves knowledge preservation.
- contrastive prompt learning scheme is designed to isolate task-specific knowledge to facilitate efficient memory storage, and to explicitly mitigate update conflict.
florasalim.bsky.social
Excited to share that Bisecle is accepted at #NeurIPS2025.
🧠 Bisecle: Binding and Separation in Continual Learning for Video Language Understanding.
Preprint: arxiv.org/abs/2507.00469
Code: github.com/cruiseresear...
Inspired by the rapid binding and pattern separation mechanisms in the hippocampus
florasalim.bsky.social
Multimodal CL #postdocjob #jobs 🦘🎓
florasalim.bsky.social
* Potential for LLM steering:  The research explored the potential to manipulate ToM-related information within the LLMs to generate more aligned and contextually appropriate responses.

The first author - 1st year student Mehdi Jafari is attending his first academic conference #ACL2025.
florasalim.bsky.social
* ToM-informed Alignment Improves Response Quality: Empirical evaluations of LLaMA-3 models (3B and 8B) demonstrated that incorporating ToM principles into the conversational agents improved response quality significantly, achieving win rates of 63% and 67% respectively.
florasalim.bsky.social
Key Findings:
* LLMs Can Represent and Retain ToM-related Constructs: The study investigated whether LLMs could represent and retain ToM-related constructs and found evidence supporting this ability.

* ToM-informed Alignment Improves Response Quality:
florasalim.bsky.social
In Beyond Words, we explore:
a) The extent to which the activation space of LLMs represents ToM of interlocutors,
b) Whether these representations form a consistent model of ToM,
and
c) How can we leverage ToM-related features to generate more aligned responses?
florasalim.bsky.social
Current LLMs often generate contextually appropriate responses, but they don’t truly understand the user's goals, beliefs, or misunderstandings. 
Using ToM, we can analyse interlocutor behaviours based on the understanding of their mental and emotional states.
florasalim.bsky.social
Beyond Words: Integrating Theory of Mind into Conversational Agents for Human-Like Belief, Desire, and Intention Alignment (#ACL2025 Findings)

aclanthology.org/2025.finding...

Codes: github.com/cruiseresear...

Findings in the thread below.

Reposted by: Flora D. Salim

abeirami.bsky.social
If you reviewed for #ICLR, please make sure to read other reviewers' comments too and reflect on whether you may have missed something.

The paper will need to have a single decision; the point of this exercise is not just about addressing each reviewer's concerns individually.

Reposted by: Flora D. Salim

florasalim.bsky.social
Overleaf is down. Ah well... theoretically, only 30,000+ people hammering it 2 days before NeurIPS deadline.

Reposted by: Flora D. Salim

Reposted by: Flora D. Salim

florasalim.bsky.social
NVIDIA Dynamo. Exciting announcement by Jensen Huang #gtc2025. A new open source VMWare-like inference framework for reasoning. It breaks up prefill and decode steps efficiently. DeepSeek R1 requests can be boosted by 30x. Already used by Perplexity, Meta, etc
developer.nvidia.com/blog/introdu...
florasalim.bsky.social
I’m in San Jose this week for NVIDIA GTC. I’m a panelist for “The Role of AI and Accelerated Computing in Understanding and Mitigating Urban Climate Change” session.
We'll discuss how AI transforms climate modeling, weather forecasting, and high-resolution urban simulations.
Anyone else attending?
florasalim.bsky.social
We know many cases of automated hiring gone wrong in the past. Automated firing? Doesn’t look good. No guarantee on transparency. The agency will be left to an unnamed LLM: “info would be fed into an unspecified LLM that would assess whether an employee was necessary” www.wired.com/story/doge-a...
DOGE Is Working on Software That Automates the Firing of Government Workers
Operatives working for Elon Musk’s DOGE appear to be editing the code of AutoRIF—software designed by the Defense Department that could assist in mass firings of federal workers, sources tell WIRED.
www.wired.com
erictopol.bsky.social
Francis Collins, the NIH Director for 12 years, led the Human Genome Project and other NIH efforts for 32 years, resigned today. Key words from his resignation letter
www.nytimes.com/2025/03/01/u...
florasalim.bsky.social
The Brick-by-Brick 2024 challenge focuses only on the multi-label classification problem, which we consider to be harder, and the holy grail for automation and management of net-zero and sustainable buildings.
Round 2 of Brick by Brick 2024 has commenced! To join: www.aicrowd.com/challenges/b...
AIcrowd | Brick by Brick 2024 | Challenges
Automating Building Data Classification
www.aicrowd.com
florasalim.bsky.social
The task also tackles issues like imbalanced data and sparse labels, all while addressing real-world problems like building optimization and sustainability.
Our NeurIPS 2024 paper includes both a multi-label classification benchmark and a zero-shot forecasting benchmark.
neurips.cc/virtual/2024...
NeurIPS Poster Building Timeseries Dataset: Empowering Large-Scale Building AnalyticsNeurIPS 2024
neurips.cc
florasalim.bsky.social
BTS is more challenging than existing TS datasets and a lot more interesting because BTS captures the complexities of real-world operations: 1) Temporal Irregularity; 2) Spatial Heterogeneity; 3) Long-tail Distribution.
-- requires models to manage hierarchical dependencies and ensure consistency.
florasalim.bsky.social
** Knowledge Graph (KG)
BTS also includes a KG that captures the relationships between TS and their physical, logical, and virtual entities.
Making it a great case for Hierarchical Multi-Label Classification. The TS are to be classified across nested categories (e.g. Point>Sensor>Air Quality>CO2).

References

Fields & subjects

Updated 1m