"Texera: Cloud-Based Collaborative Data Science and AI/ML Using Workflows" IEEE OC CS & SSCS and OC ACM Meeting

#texera #cloud-based-data-science #collaborative-data-science #data-science-with-AI/ML #data-science-using-workflows #data-science #AI #ML
Share

Please fill out the three question Survey (last link under "Hosts") for IEEE attendance planning and the option to view a recording of the presentation,


Event Details  

Since 2016 our team at UC Irvine has been developing the Texera open-source system (texera.io), with the goal of enabling a cloud-based platform to support collaborative data science, AI, and ML. It allows users with various backgrounds, including those with limited coding skills, domain scientists, and ML experts, to conduct AI-centric data science with a collaboration experience similar to Google Docs.

After eight years of development, the system has a rich set of features, such as shared editing, shared execution, version control, commenting, debugging, user-defined functions in multiple languages (e.g., Python, R, Java), and support of state-of-the-art AI/ML techniques. Its backend parallel engine enables scalable computation on large data sets using computing clusters. It allows bioinformaticians to elastically request resources from AWS to form a cluster to run computationally intensive jobs. It also supports community-based sharing of resources including datasets and workflows.

In this talk, we will give an overview of the system, discuss research challenges encountered in the development and our solutions, and give an overview of its software architecture to achieve high usability, scalability, and reliability. We will show several domains that adopt the systems, including education and scientific

About the Speakers, Prof. Chen Li & Jiadong Bai

Prof. Chen Li is a professor in the Department of Computer Science at UC Irvine. He received his Ph.D. degree in Computer Science from Stanford University, and his M.S. and B.S. in Computer Science from Tsinghua University, China. His research interests are in the fields of data management, data science, AI/ML, databases, data-intensive computing, search, and visualization. He was a co-founder and CTO of a startup to commercialize his research. He was a recipient of recognitions including an NSF CAREER award and test-of-time awards. He is an ACM Distinguished Member, and an IEEE fellow.

Jiadong Bai is a second-year Ph.D. student in Computer Science at UC Irvine, advised by Prof. Chen Li. His research focuses on building intelligent systems for big data management, with interests in integrating AI into different aspects of data science. In the Texera project, he is a lead designer of its cloud infrastructure, and the storage

NOTE:  This is an IN PERSON meeting and you should register if you plan on attending.  Parking in the near by structure will be validated.  To receive a link to the event's recording, complete the survey referenced in the header or footer; you do not need to register to receive the link to the recording.



  Date and Time

  Location

  Hosts

  Registration



  • Date: 22 May 2025
  • Time: 01:30 AM UTC to 03:30 AM UTC
  • Add_To_Calendar_icon Add Event to Calendar

  • Contact Event Hosts
  • Co-sponsored by OC ACM [Actual Host] and Knobbe Martens, an Intellectual Property & Technology law firm [Physical Host Donor]
  • Survey: Fill out the survey






Agenda

6:30 PM Networking at physical meeting location
7:00 PM Announcements and Presentation with Q&A
8:00 PM Follow-up quesitons for presenter and networking
8:30 PM Meeting Adjourned



Please fill out the three question Survey for IEEE attendance (last link under "Hosts") and a link to the recording when available.