talk-data.com talk-data.com

F

Speaker

Flavio Villanustre

2

talks

CISSP, CISO & SVP of Technology, LexisNexis Risk Solutions HPCC Systems

Flavio Villanustre, currently serving as the Chief Information Security Officer (CISO) and Senior Vice President of Technology at LexisNexis Risk Solutions, is an esteemed professional renowned for his expertise in cybersecurity and big data innovation. In his key roles at LexisNexis, Flavio has accumulated a wealth of experience in steering information security and technology strategy. His practical approach to addressing challenges in these domains has contributed significantly to the company's success. Flavio is also recognized for his contributions to open-source initiatives, with a notable focus on the HPCC Systems platform—a pragmatic solution for big data analytics. His dedication to advancing this platform has solidified his standing as a respected figure in the expansive field of massive data-intensive computing. With a commitment to practical solutions and a focus on addressing industry challenges, Flavio Villanustre brings a valuable perspective to the intersection of information technology, data management, and cybersecurity. As we welcome him to our conference, we anticipate gaining insights from his experiences that will resonate with our diverse audience.

Bio from: Data Universe 2024

Filter by Event / Source

Talks & appearances

2 activities · Newest first

Search activities →

This session explores the technological advancements of analytics, data science, machine learning, AI, and quantum computing. Beginning with an overview of the historical development and current state of these technologies. LexisNexis' Flavio Villanustre will explore how analytics has transformed industries; how data science extracts valuable insights, how machine learning has moved from simple rule-based systems to today's deep learning models; and how quantum computing promises to solve previously intractable problems at speeds unattainable by classical computers.

Attendees will learn about the benefits and associated risks of each technology: Increases in efficiency, productivity, and innovation juxtaposed against vulnerabilities in privacy, security, bias, and wider societal concerns such as unemployment. These competing issues underscore the need for responsible, ethical research and deployment of such tools.

Summary Managing big data projects at scale is a perennial problem, with a wide variety of solutions that have evolved over the past 20 years. One of the early entrants that predates Hadoop and has since been open sourced is the HPCC (High Performance Computing Cluster) system. Designed as a fully integrated platform to meet the needs of enterprise grade analytics it provides a solution for the full lifecycle of data at massive scale. In this episode Flavio Villanustre, VP of infrastructure and products at HPCC Systems, shares the history of the platform, how it is architected for scale and speed, and the unique solutions that it provides for enterprise grade data analytics. He also discusses the motivations for open sourcing the platform, the detailed workflow that it enables, and how you can try it for your own projects. This was an interesting view of how a well engineered product can survive massive evolutionary shifts in the industry while remaining relevant and useful.

Announcements

Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. With 200Gbit private networking, scalable shared block storage, and a 40Gbit public network, you’ve got everything you need to run a fast, reliable, and bullet-proof data platform. If you need global distribution, they’ve got that covered too with world-wide datacenters including new ones in Toronto and Mumbai. And for your machine learning workloads, they just announced dedicated CPU instances. Go to dataengineeringpodcast.com/linode today to get a $20 credit and launch a new server in under a minute. And don’t forget to thank them for their continued support of this show! To connect with the startups that are shaping the future and take advantage of the opportunities that they provide, check out Angel List where you can invest in innovative business, find a job, or post a position of your own. Sign up today at dataengineeringpodcast.com/angel and help support this show. You listen to this show to learn and stay up to date with what’s happening in databases, streaming platforms, big data, and everything else you need to know about modern data management.For even more opportunities to meet, listen, and learn from your peers you don’t want to miss out on this year’s conference season. We have partnered with organizations such as O’Reilly Media, Dataversity, Corinium Global Intelligence, and Data Counsil. Upcoming events include the O’Reilly AI conference, the Strata Data conference, the combined events of the Data Architecture Summit and Graphorum, and Data Council in Barcelona. Go to dataengineeringpodcast.com/conferences to learn more about these and other events, and take advantage of our partner discounts to save money when you register today. Go to dataengineeringpodcast.com to subscribe to the show, sign up for the mailing list, read the show notes, and get in touch. To help other people find the show please leave a review on iTunes and tell your friends and co-workers Join the community in the new Zulip chat workspace at dataengineeringpodcast.com/chat Your host is Tobias Macey and today I’m interviewing Flavio Villanustre about the HPCC Systems project and his work at LexisNexis Risk Solutions

Interview

Introduction How did you get involved in the area of data management? Can you start by describing what the HPCC system is and the problems that you were facing at LexisNexis Risk Solutions which led to its creation?

What was the overall state of the data landscape at the time and what was the motivation for releasing it as open source?

Can you describe the high level architecture of the HPCC Systems platform and some of the ways that the design has changed over the years that it has been maintained? Given how long the project has been in use, c