Video

Inside the AI Cloud Shift and the Future of Infrastructure

Play video

Part of the Who’s Ready for Anything series, this episode features CoreWeave’s Chen Goldberg at NVIDIA GTC 2026. Learn how AI is reshaping the cloud from the ground up and what it takes to move from experimentation to production.

In this video:

  • Why the shift from experimentation to production is defining the next phase of AI
  • How AI workloads are changing requirements for compute, storage, and networking
  • What makes an AI-native cloud different from traditional cloud models
  • Why speed of experimentation drives innovation and competitive advantage

1

00:00:12,346 --> 00:00:13,192

Hey, everyone.

2

00:00:13,192 --> 00:00:16,192

Welcome to the AI Cloud Essentials podcast.

3

00:00:16,462 --> 00:00:18,731

I'm Lisa Martin, your host for the next couple of days.

4

00:00:18,731 --> 00:00:21,808

We are live at GTC. This is day 2.

5

00:00:22,115 --> 00:00:24,808

We are in, as you can see behind me, the buzz of the event,

6

00:00:24,808 --> 00:00:27,808

This gives you a little bit of a glimpse of the energy at GTC.

7

00:00:28,154 --> 00:00:31,500

It is electric. I’m so thrilled to be joined by Chen Goldberg.

8

00:00:31,769 --> 00:00:34,654

She's the EVP of Product & Engineering at CoreWeave.

9

00:00:34,654 --> 00:00:36,462

We're going to be talking to you a little bit about what

10

00:00:36,462 --> 00:00:38,000

Chen has been talking about with the audience,

11

00:00:38,000 --> 00:00:39,462

what she’s been hearing.

12

00:00:39,462 --> 00:00:43,231

and the overall partnership with NVIDIA and CoreWeave.

13

00:00:43,500 --> 00:00:45,038

Chen it's so great to have you on the podcast.

14

00:00:45,038 --> 00:00:46,154

Thank you so much for having me.

15

00:00:46,154 --> 00:00:47,308

Thank you so much for having me.

16

00:00:47,308 --> 00:00:48,308

So this is Day 2. Yes.

17

00:00:48,308 --> 00:00:51,731

Ane the energy yesterday was off the charts.

18

00:00:52,654 --> 00:00:56,231

People were describing this is the AI Super Bowl,

19

00:00:56,385 --> 00:00:59,038

the heartbeat of AI. Tell me a little bit...

20

00:00:59,038 --> 00:01:01,692

You were on stage with Corey yesterday.

21

00:01:01,692 --> 00:01:05,000

What were some of the things that you were sharing, and what stuck out to you

22

00:01:05,000 --> 00:01:08,231

in terms of what the audience is really absorbing.

23

00:01:08,577 --> 00:01:10,692

So this is, for me, the second time as part of CoreWeave

24

00:01:10,692 --> 00:01:12,846

that we are presenting at GTC.

25

00:01:13,423 --> 00:01:15,769

And the big change, right.

26

00:01:15,769 --> 00:01:18,038

If you think about last year,

27

00:01:18,038 --> 00:01:19,692

there was a lot of uncertainty.

28

00:01:19,692 --> 00:01:21,462

Things around, like how inference would be.

29

00:01:21,500 --> 00:01:22,615

What would be the size of the models.

30

00:01:22,615 --> 00:01:26,192

Can you create applications with AI?

31

00:01:26,692 --> 00:01:28,538

And what we’ve seen over the past

32

00:01:28,538 --> 00:01:30,423

12 months has been mind blowing.

33

00:01:30,654 --> 00:01:32,808

And everybody that is here noticed that.

34

00:01:32,808 --> 00:01:35,808

And I think that's from our perspective, the most exciting thing

35

00:01:36,115 --> 00:01:41,269

is that we, instead of just talking about basic education,

36

00:01:41,500 --> 00:01:45,385

we're talking about how can you move from experimentation to production?

37

00:01:45,692 --> 00:01:46,808

What do you need to do?

38

00:01:46,808 --> 00:01:48,808

How do you bring folks onboard?

39

00:01:48,808 --> 00:01:50,615

As an engineering leader,

40

00:01:50,615 --> 00:01:54,654

there's also a lot of conversation of productivity and the tools you are using.

41

00:01:54,692 --> 00:01:55,462

There are so many things

42

00:01:55,462 --> 00:01:59,192

that have changed over these 12 months, and that's really amazing.

43

00:01:59,385 --> 00:02:03,231

You speaking... I mean, in AI we speak of like six month,

44

00:02:03,269 --> 00:02:06,462

three month, 12 month timeframes,

45

00:02:06,462 --> 00:02:08,115

because it is moving...

46

00:02:08,115 --> 00:02:09,692

I can't even I can't describe the speed. It's amazing.

47

00:02:09,692 --> 00:02:14,308

But yesterday in Jensen's keynote was NVIDIA [heart] CoreWeave.

48

00:02:14,308 --> 00:02:15,731

Yes.

49

00:02:15,731 --> 00:02:17,115

I got chills just seeing that.

50

00:02:17,115 --> 00:02:19,731

And then when Jensen came by the booth,

51

00:02:19,731 --> 00:02:21,154

I heard him say to you and the leaders,

52

00:02:21,154 --> 00:02:22,538

just very genuine things.

53

00:02:22,538 --> 00:02:26,115

Talk about the expansion, the deepening of the NVIDIA CoreWeave

54

00:02:26,115 --> 00:02:28,115

relationship and what that will enable customers

55

00:02:28,115 --> 00:02:31,615

to do to get from what you said, experimentation to production.

56

00:02:32,731 --> 00:02:35,577

That was definitely a very special moment for us, as well.

57

00:02:35,577 --> 00:02:39,154

And we we thank Jensen for that.

58

00:02:39,154 --> 00:02:41,423

Because it was a recognition of the hard work that

59

00:02:41,423 --> 00:02:43,692

the entire CoreWeave team and our customers

60

00:02:43,692 --> 00:02:45,577

and the partners and and everybody that trusted in us,

61

00:02:45,577 --> 00:02:50,615

went through because, know...

62

00:02:50,615 --> 00:02:57,269

Everybody thinks of us as, like, a GPU reseller.

63

00:02:57,269 --> 00:03:00,038

And then recognition that we are leading in this space

64

00:03:00,038 --> 00:03:02,731

and being the AI cloud, was amazing.

65

00:03:03,115 --> 00:03:05,846

And really candidly,

66

00:03:05,846 --> 00:03:10,077

NVIDIA has been an amazing partner customer.

67

00:03:10,231 --> 00:03:12,885

And we are their customer, as well.

68

00:03:12,885 --> 00:03:16,192

And I would say that there are like a couple of things

69

00:03:16,192 --> 00:03:19,000

that really work well, between our two companies.

70

00:03:19,000 --> 00:03:22,692

One, we are really leaning in.

71

00:03:22,692 --> 00:03:24,846

Okay. We we believe in AI.

72

00:03:24,846 --> 00:03:26,885

We have a very similar vision.

73

00:03:26,885 --> 00:03:29,077

And that's really helping.

74

00:03:29,077 --> 00:03:31,538

We are really focused on customers together.

75

00:03:31,538 --> 00:03:33,808

The second thing that I think is really critical,

76

00:03:33,808 --> 00:03:36,808

you know, we talk about how our customers want to experiment.

77

00:03:37,077 --> 00:03:38,769

We like to experiment.

78

00:03:38,769 --> 00:03:40,269

Okay. I think all of us as people,

79

00:03:40,269 --> 00:03:42,692

we need to be humble and knowing like, hey, we're not great at

80

00:03:42,692 --> 00:03:43,692

predicting the future.

81

00:03:44,808 --> 00:03:45,846

But if we experiment and we see signals

82

00:03:45,846 --> 00:03:49,808

and we move fast, we get to great results.

83

00:03:49,808 --> 00:03:52,808

And NVIDIA also has that kind of culture.

84

00:03:53,192 --> 00:03:57,308

And we are both really wanting to build the best products.

85

00:03:57,846 --> 00:04:00,115

And that's actually what led, you know

86

00:04:00,115 --> 00:04:02,692

we had a big announcement last month.

87

00:04:02,692 --> 00:04:07,231

And you know the media definitely got the investment part,

88

00:04:07,500 --> 00:04:09,154

the billion dollar amount.

89

00:04:09,154 --> 00:04:12,269

But there were more things there that made me really excited.

90

00:04:12,269 --> 00:04:13,654

Tell me.

91

00:04:14,038 --> 00:04:16,615

You know, one thing was, of course, us using

92

00:04:16,615 --> 00:04:21,231

CPUs now from so Jensen in his keynote, he talked about expanding new platforms,

93

00:04:21,231 --> 00:04:24,192

so expanding from just GPUs to also CPU.

94

00:04:24,192 --> 00:04:27,692

And the other thing that we were really excited about is two things.

95

00:04:27,692 --> 00:04:30,038

One is the collaboration.

96

00:04:30,038 --> 00:04:33,769

It was actually telling the world how our teams have been collaborating

97

00:04:34,154 --> 00:04:37,308

on producing reference architecture and improving products.

98

00:04:38,000 --> 00:04:39,077

So that was one part.

99

00:04:39,077 --> 00:04:41,192

And the second part was really about us.

100

00:04:42,269 --> 00:04:45,346

Getting into the world and offering our services

101

00:04:45,346 --> 00:04:47,192

and our software to other people

102

00:04:47,462 --> 00:04:50,462

in the industry outside of even CoreWeave cloud.

103

00:04:50,577 --> 00:04:52,692

We started that with the acquisition we made

104

00:04:52,692 --> 00:04:55,077

with Weights and Biases, which is already a multi-cloud.

105

00:04:55,077 --> 00:04:58,654

But having more and more cross-cloud cloud solutions

106

00:04:58,654 --> 00:05:01,038

is something that, we plan to invest more in.

107

00:05:01,346 --> 00:05:02,577

And that's what customers are demanding.

108

00:05:02,577 --> 00:05:03,846

You talked about, basically the customer obsession

109

00:05:03,846 --> 00:05:08,385

and that symbiosis that you share with NVIDIA.

110

00:05:08,385 --> 00:05:10,692

You said something yesterday on LinkedIn, I stalked you.

111

00:05:10,692 --> 00:05:12,154

And I wanted to get your thoughts behind it.

112

00:05:12,154 --> 00:05:13,538

You were speaking with Corey.

113

00:05:13,538 --> 00:05:15,385

And you said, GTC this year 2026

114

00:05:15,385 --> 00:05:18,385

is about the next great leap.

115

00:05:18,385 --> 00:05:21,692

Tokens powering robots,

116

00:05:21,692 --> 00:05:23,500

energy grids, scientific discovery.

117

00:05:23,500 --> 00:05:24,885

I love that.

118

00:05:24,885 --> 00:05:27,846

You said, that leap needs a factory behind it.

119

00:05:27,846 --> 00:05:29,308

Talk about how NVIDIA is that factory,

120

00:05:29,462 --> 00:05:32,462

and how CoreWeave is an enabler of that.

121

00:05:32,692 --> 00:05:36,577

Back again, this year what Jensen was talking on stage,

122

00:05:36,577 --> 00:05:39,500

and also what we hear from people around us,

123

00:05:39,500 --> 00:05:44,231

is that in most companies,

124

00:05:44,231 --> 00:05:47,654

because the tools, by the way, have gotten so great

125

00:05:47,654 --> 00:05:49,462

they start seeing value.

126

00:05:49,462 --> 00:05:52,192

Yesterday on stage we had, for example,

127

00:05:52,192 --> 00:05:53,615

a person from Mercado Libre.

128

00:05:54,077 --> 00:05:56,077

So Sebastian from Mercado Libre,

129

00:05:56,077 --> 00:05:58,038

he joined us and he was telling

130

00:05:58,038 --> 00:06:01,038

the audience how they are planning to

131

00:06:01,846 --> 00:06:05,000

re imagine their search capabilities

132

00:06:05,000 --> 00:06:06,385

in their e-commerce platform.

133

00:06:06,385 --> 00:06:10,000

But even before doing that, he was talking about how even with

134

00:06:10,000 --> 00:06:13,885

small experimentation, they've been getting amazing results already.

135

00:06:14,654 --> 00:06:16,000

And what I love about it is,

136

00:06:16,000 --> 00:06:18,885

you know, like downstairs we have like a physical AI demo.

137

00:06:19,192 --> 00:06:22,192

So you see in different industry, whether it's health

138

00:06:22,500 --> 00:06:24,885

finance, e-commerce,

139

00:06:24,885 --> 00:06:29,077

media, a lot of areas where we see

140

00:06:29,577 --> 00:06:33,346

starting for a small experimentation to bigger opportunities.

141

00:06:34,038 --> 00:06:37,192

And that's really what NVIDIA is talking about.

142

00:06:37,192 --> 00:06:40,308

And part of what I think NVIDIA again, was really highlighting, Jensen

143

00:06:40,308 --> 00:06:44,385

was highlighting yesterday, that it's not just NVIDIA on its own.

144

00:06:44,808 --> 00:06:48,538

Jensen has been really investing in building an ecosystem.

145

00:06:49,308 --> 00:06:52,154

And I really appreciate that.

146

00:06:52,154 --> 00:06:55,000

I was actually part of the cloud

147

00:06:55,000 --> 00:06:59,077

transformation in the industry and the ecosystem was key to that.

148

00:06:59,462 --> 00:07:02,462

And I think Jensen is definitely recognizing

149

00:07:02,731 --> 00:07:05,731

and we are participating across the board,

150

00:07:06,000 --> 00:07:08,808

from developer tools to researcher tools

151

00:07:08,808 --> 00:07:13,000

to applications to infrastructure and just building

152

00:07:13,000 --> 00:07:16,846

that momentum, that flywheel, it creates that innovation.

153

00:07:16,885 --> 00:07:20,654

Yes. Well, the validation, the recognition you’re talking about

154

00:07:20,654 --> 00:07:22,308

from Jensen, but also to your point,

155

00:07:22,308 --> 00:07:24,808

And I talked about this on TV all the time,

156

00:07:24,808 --> 00:07:26,846

NVIDIA is not doing this alone.

157

00:07:26,846 --> 00:07:28,731

They are synonymous with AI.

158

00:07:28,731 --> 00:07:31,154

Even people locally around here that are Uber drivers

159

00:07:31,154 --> 00:07:33,000

drivers are asking me, they know AI. They see Jensen and the know AI.

160

00:07:33,000 --> 00:07:34,500

They see Jensen, AI.

161

00:07:34,500 --> 00:07:36,385

But it's the ecosystem

162

00:07:36,385 --> 00:07:39,462

and NVIDIA seems to really respect that and acknowledge it,

163

00:07:39,462 --> 00:07:42,769

obviously, with the NVIDIA [heart] CoreWeave.

164

00:07:42,769 --> 00:07:46,692

But in terms of like differentiation, there's a lot of a lot of companies here.

165

00:07:47,269 --> 00:07:51,192

The energy at this conference, I've been to a lot of them as you, is next level.

166

00:07:51,192 --> 00:07:53,769

but there's a lot of, I don't want to say me, too.

167

00:07:53,769 --> 00:07:56,423

But a lot of people saying, we’re the AI cloud.

168

00:07:56,423 --> 00:07:59,192

What makes CoreWeave really stand out

169

00:07:59,192 --> 00:08:03,154

as the essential cloud for AI in 2026 and beyond?

170

00:08:04,577 --> 00:08:05,808

I I think that a year ago

171

00:08:05,808 --> 00:08:08,769

people were not true, that we need an AI cloud.

172

00:08:09,115 --> 00:08:12,500

But now folks, that once they are trying to experiment more,

173

00:08:12,538 --> 00:08:14,077

they see there is a need.

174

00:08:14,077 --> 00:08:17,423

And one of the things, you know, and maybe I can tell you why I joined call wave.

175

00:08:17,654 --> 00:08:21,500

So before Call wave, I was part of Google Cloud and working,

176

00:08:21,500 --> 00:08:24,577

on building that cloud native ecosystem.

177

00:08:25,385 --> 00:08:28,385

And back at the day

178

00:08:28,462 --> 00:08:31,808

when we talked about the value of the cloud,

179

00:08:32,308 --> 00:08:35,077

it wasn't just enough to move to the cloud, right?

180

00:08:35,077 --> 00:08:36,269

The lift and shift.

181

00:08:36,269 --> 00:08:38,000

There was a change of ecosystem.

182

00:08:38,000 --> 00:08:39,269

You had to change the way you do tooling,

183

00:08:39,269 --> 00:08:41,538

how you develop, how you deploy, how you manage.

184

00:08:42,538 --> 00:08:44,077

And when I started

185

00:08:44,077 --> 00:08:47,500

experimenting and experiencing more of these new type of workloads,

186

00:08:48,500 --> 00:08:51,500

it felt similar, but actually a much bigger,

187

00:08:51,769 --> 00:08:54,769

opportunity to reimagine how cloud should look like.

188

00:08:55,154 --> 00:08:58,115

And I think that's the most unique thing about CoreWeave.

189

00:08:58,462 --> 00:09:01,000

And that from the get go, right,

190

00:09:01,000 --> 00:09:04,077

if you talk with the founders, that's what they were trying to do.

191

00:09:04,231 --> 00:09:09,731

It was not like me to, Peter, our CTO, they took a step back and said,

192

00:09:09,731 --> 00:09:12,692

like, okay, what are the hard problems we need to solve?

193

00:09:12,846 --> 00:09:15,192

They didn't just look at the other reference architecture.

194

00:09:15,192 --> 00:09:17,038

They actually build from scratch.

195

00:09:17,038 --> 00:09:20,077

And there are some things that we are doing very differently

196

00:09:20,308 --> 00:09:23,308

than others, and that's what really delivered the results.

197

00:09:23,808 --> 00:09:27,346

I think the way that you I, I think it's easy to understand

198

00:09:27,346 --> 00:09:30,346

there are like probably three categories that we are doing differently.

199

00:09:30,500 --> 00:09:33,423

The first one is that we are,

200

00:09:34,885 --> 00:09:37,885

started with a vertically integrated stack.

201

00:09:38,000 --> 00:09:38,346

Okay.

202

00:09:38,346 --> 00:09:38,731

You now

203

00:09:38,731 --> 00:09:42,231

see some of the folks announcing it, but it's like they're trying to retrofit

204

00:09:42,615 --> 00:09:45,885

that vertical integration was a huge difference.

205

00:09:47,308 --> 00:09:50,308

What we are doing, we understand the complexity of the stack

206

00:09:51,385 --> 00:09:52,385

is huge.

207

00:09:52,385 --> 00:09:55,885

And in order to manage that, you need to allow flexibility

208

00:09:55,885 --> 00:10:00,115

and quick decisions across the stack and knowing how to react to events

209

00:10:00,654 --> 00:10:01,423

lower in the stack.

210

00:10:01,423 --> 00:10:06,346

So we like saying from metal to model, from metal to job.

211

00:10:07,077 --> 00:10:07,500

And right.

212

00:10:07,500 --> 00:10:12,038

Bringing all of that information and really creating a new way to operate

213

00:10:12,654 --> 00:10:15,192

a cloud stack in the AI era,

214

00:10:15,192 --> 00:10:18,577

we differentiate we think capabilities like mission control

215

00:10:18,846 --> 00:10:22,615

that are both reactive and proactive in solving customer problems.

216

00:10:22,615 --> 00:10:23,885

So that would be one.

217

00:10:23,885 --> 00:10:27,538

The second part that even though we were thinking about an integrated stack,

218

00:10:28,115 --> 00:10:32,577

we are still looking for opportunities to optimize specific problems

219

00:10:32,577 --> 00:10:35,231

or bottlenecks. And again, we don't try to solve for everything.

220

00:10:35,231 --> 00:10:37,192

We are really, really focused.

221

00:10:37,192 --> 00:10:41,154

So if you think about AI workloads, and one of the challenges is

222

00:10:41,462 --> 00:10:45,346

you want to make sure the GPUs are, with high utilization.

223

00:10:45,346 --> 00:10:49,115

They are very expensive, very important resource.

224

00:10:49,423 --> 00:10:53,308

So we are building, for example, specific solutions that will bring more data

225

00:10:53,808 --> 00:10:57,192

into the GPU with our own distributed caching mechanism, or

226

00:10:57,731 --> 00:11:01,423

building a new solution for orchestration of workloads,

227

00:11:01,654 --> 00:11:05,231

for example, again, for the technical audience folks here that are familiar

228

00:11:05,231 --> 00:11:09,423

with Slurm, Slurm is a known

229

00:11:10,885 --> 00:11:13,038

industry standard for,

230

00:11:13,038 --> 00:11:16,038

HP workloads, high performing workloads.

231

00:11:16,423 --> 00:11:18,808

But what we've done, we created a solution that bring

232

00:11:18,808 --> 00:11:21,808

those workloads into the cloud native era.

233

00:11:22,269 --> 00:11:26,654

And the last point, which maybe that's really brings us back to where we started.

234

00:11:27,231 --> 00:11:30,885

We know when we talk with our customers that speed of experimentation

235

00:11:31,192 --> 00:11:33,115

is the most important thing.

236

00:11:33,115 --> 00:11:36,115

And I'm a believer that we cannot predict the future.

237

00:11:36,692 --> 00:11:40,654

So how can we increase the signals that people see?

238

00:11:41,000 --> 00:11:44,385

Okay, how quickly can I experiment, iterate and see what I need to go?

239

00:11:44,769 --> 00:11:48,538

And so we have a lot of investment on that, both from an infrastructure

240

00:11:48,538 --> 00:11:52,192

perspective but also on tunings, like we said, W and B models.

241

00:11:52,192 --> 00:11:56,462

And we've just giving those signals that allow our customers

242

00:11:56,462 --> 00:12:00,308

to make decisions faster and with confidence.

243

00:12:00,654 --> 00:12:03,000

That's the key word is confidence.

244

00:12:03,000 --> 00:12:04,808

Yeah. Because the speed is just

245

00:12:05,846 --> 00:12:07,192

not going to slow down.

246

00:12:07,192 --> 00:12:07,885

What do you think you said

247

00:12:07,885 --> 00:12:11,308

when we were preparing for this conversation was it's not just meetings.

248

00:12:12,000 --> 00:12:12,577

Yeah.

249

00:12:12,577 --> 00:12:16,423

So walk me through the steps of for each stack started to allude to it

250

00:12:17,038 --> 00:12:21,346

in a part for the technical side and explain where the differentiation lies

251

00:12:21,538 --> 00:12:24,500

and where are you really enabling customers to build

252

00:12:24,500 --> 00:12:28,269

the AI infrastructure from scratch to power?

253

00:12:28,308 --> 00:12:31,115

Awesome, and thank you so much for asking it.

254

00:12:31,115 --> 00:12:33,654

I think that we are using a lot of jargon all the time,

255

00:12:33,654 --> 00:12:35,423

and people don't always understand.

256

00:12:35,423 --> 00:12:37,538

What does it actually mean to build a cloud?

257

00:12:37,538 --> 00:12:39,269

How does it look like?

258

00:12:39,269 --> 00:12:42,769

And so first of all, there is of course the physical infrastructure.

259

00:12:43,231 --> 00:12:43,654

Okay.

260

00:12:43,654 --> 00:12:48,692

And there is already just on that layer, there's a lot of new challenges

261

00:12:48,885 --> 00:12:52,692

that appear in the AI era from a lot of power

262

00:12:52,692 --> 00:12:55,692

consumption, cooling needs.

263

00:12:56,115 --> 00:12:59,115

And there's a lot of work that our team is doing,

264

00:12:59,500 --> 00:13:02,423

around, power efficiency,

265

00:13:02,423 --> 00:13:07,538

space, liquid cooling, and just making sure that we build and,

266

00:13:07,538 --> 00:13:12,385

of course, security, across the stack, that we innovate in that space.

267

00:13:12,692 --> 00:13:16,808

But then on top of that, there is actually a different layers of the stack.

268

00:13:17,077 --> 00:13:20,692

You know, when we say infrastructure, we should all think about,

269

00:13:21,654 --> 00:13:23,654

starting with what we call infrastructure as a service.

270

00:13:23,654 --> 00:13:26,846

Very simple compute, storage and network.

271

00:13:26,846 --> 00:13:27,538

Right.

272

00:13:27,538 --> 00:13:31,000

What's been very interesting, when you move from the cloud 1.0

273

00:13:31,192 --> 00:13:34,192

to cloud 2.0, is that

274

00:13:35,077 --> 00:13:39,269

in the cloud 1.0 ERA, the goal was to make

275

00:13:39,538 --> 00:13:43,115

infrastructure, specifically compute, storage and network boring.

276

00:13:43,808 --> 00:13:44,808

Okay, okay, okay.

277

00:13:44,808 --> 00:13:48,808

I'm as a developer, as a business owner, I should know I think about it. Yes.

278

00:13:48,808 --> 00:13:50,308

Should not care about that. And. Right.

279

00:13:50,308 --> 00:13:52,385

And that's been my journey.

280

00:13:52,385 --> 00:13:54,615

In the industry making that

281

00:13:54,615 --> 00:13:57,615

a reality with technologies like Kubernetes.

282

00:13:57,769 --> 00:14:01,115

Make it make it invisible, make the infrastructure boring.

283

00:14:01,115 --> 00:14:02,731

You can you can like Google that.

284

00:14:02,731 --> 00:14:05,615

And you will see like those kind of quotes.

285

00:14:05,615 --> 00:14:08,385

And I think we've actually done a great job,

286

00:14:08,385 --> 00:14:11,115

in the industry of creating technologies that have done that.

287

00:14:11,115 --> 00:14:11,808

However,

288

00:14:13,192 --> 00:14:15,654

what's happening right now, and that's exactly what Jensen

289

00:14:15,654 --> 00:14:20,115

was talking about yesterday, is that the workloads, the applications,

290

00:14:21,538 --> 00:14:24,538

they care about, the compute storage network,

291

00:14:25,654 --> 00:14:27,038

okay, because

292

00:14:27,038 --> 00:14:30,385

the amount of data that has to go on the network is impacting the latency.

293

00:14:30,423 --> 00:14:33,423

And now let's say I have an agent,

294

00:14:33,654 --> 00:14:36,500

this is no longer it can be a mission

295

00:14:36,500 --> 00:14:40,000

critical workload that we need someone to immediately respond.

296

00:14:40,000 --> 00:14:43,000

Or if we have video generation, right, it cannot be lagging.

297

00:14:43,308 --> 00:14:46,500

And you think about storage, how quickly can I scale?

298

00:14:46,769 --> 00:14:48,846

How do I get the data, how quickly can I load things?

299

00:14:48,846 --> 00:14:52,808

And from a compute perspective, not only

300

00:14:52,808 --> 00:14:56,654

I want to get the best utilization, we actually have lots of workloads.

301

00:14:56,654 --> 00:14:59,154

I have models that are now running

302

00:14:59,154 --> 00:14:59,577

not on a

303

00:14:59,577 --> 00:15:01,000

one node or ten nodes,

304

00:15:01,000 --> 00:15:04,000

but sometimes ten thousands of nodes.

305

00:15:04,154 --> 00:15:07,308

And that's creates a lot of, complexity in that stack.

306

00:15:07,308 --> 00:15:10,154

And we are been innovating in that area area.

307

00:15:10,154 --> 00:15:11,269

And then on top of this

308

00:15:12,385 --> 00:15:13,269

we have

309

00:15:13,269 --> 00:15:16,192

a new inference services and training services.

310

00:15:16,192 --> 00:15:16,346

Right.

311

00:15:16,346 --> 00:15:21,231

Because as a developer, I have new tasks that I need to perform.

312

00:15:21,423 --> 00:15:24,231

So I need new tools and new ecosystem.

313

00:15:24,231 --> 00:15:25,192

And maybe on top of that,

314

00:15:25,192 --> 00:15:28,885

I think the things that really excites me the most is our entire what we call our,

315

00:15:29,808 --> 00:15:32,231

you know, when we think about serverless, it's not about making

316

00:15:32,231 --> 00:15:35,231

the infrastructure boring.

317

00:15:35,462 --> 00:15:37,885

But helping our customers

318

00:15:37,885 --> 00:15:40,846

rely on us in making some of those decisions.

319

00:15:41,115 --> 00:15:44,115

So one of the things that I'm really excited about is,

320

00:15:44,462 --> 00:15:48,000

the tools that we are providing, it's like what we call that serverless layer,

321

00:15:48,000 --> 00:15:51,231

where we are not trying to really make the infrastructure disappear,

322

00:15:51,769 --> 00:15:55,654

but we're trying to help customers, to rely on us.

323

00:15:56,231 --> 00:15:56,500

Okay.

324

00:15:56,500 --> 00:16:00,385

And lets us do the heavy lifting in making those tough decisions.

325

00:16:01,038 --> 00:16:04,308

And we've been announcing this week and you,

326

00:16:04,308 --> 00:16:07,846

tool in our serverless RL, which is a reinforcement learning.

327

00:16:08,269 --> 00:16:11,731

And in that space, if before a lot of this experimentation

328

00:16:12,038 --> 00:16:16,269

required simulation data, now we are allowing customers to use production

329

00:16:16,269 --> 00:16:21,000

tracing in order to train the agents to get better automatically.

330

00:16:21,346 --> 00:16:24,038

So that would be one example of a token production.

331

00:16:24,038 --> 00:16:25,731

Yes. In production.

332

00:16:25,731 --> 00:16:28,231

Oh, for example, we have a new customer, Klein,

333

00:16:28,231 --> 00:16:29,577

that they are using our inference service.

334

00:16:29,577 --> 00:16:32,577

So they are a customer, a partner as well.

335

00:16:32,846 --> 00:16:33,654

And they are

336

00:16:34,808 --> 00:16:37,192

actually building a coding solution

337

00:16:37,192 --> 00:16:40,192

that allows them to leverage our platform.

338

00:16:40,500 --> 00:16:42,115

Maybe, you know, you asked about differentiation.

339

00:16:42,115 --> 00:16:43,769

I should have led with that.

340

00:16:43,769 --> 00:16:47,269

And we offer all this new stuff

341

00:16:48,000 --> 00:16:52,462

without compromising security and resiliency and production ready.

342

00:16:52,846 --> 00:16:55,269

And we take a lot of pride in that.

343

00:16:55,269 --> 00:16:58,346

And that's like, actually would be probably the number one reason

344

00:16:58,346 --> 00:16:59,731

why customers come to us.

345

00:16:59,731 --> 00:17:03,269

And second last question, what you talk about some of the challenges

346

00:17:03,269 --> 00:17:08,423

customers have, technical challenges, power, tooling, space, capacity.

347

00:17:09,077 --> 00:17:13,115

What are some of the business challenges that you're proud to for me, to help

348

00:17:13,154 --> 00:17:16,154

such customers eliminate,

349

00:17:16,577 --> 00:17:17,885

don't. There are many challenges.

350

00:17:17,885 --> 00:17:22,231

And, you know, like, we were just say I had a customer meeting, today

351

00:17:22,538 --> 00:17:25,308

and they said, like, you know, every time we saw the hard problem,

352

00:17:25,308 --> 00:17:27,346

a new hard problem pops up.

353

00:17:27,346 --> 00:17:30,577

So, it feels like that's the business that we are in.

354

00:17:31,808 --> 00:17:35,500

But the thing that I feel like we are most helping

355

00:17:35,846 --> 00:17:38,846

customers, one is speed of innovation.

356

00:17:39,346 --> 00:17:42,269

And the idea that our customers are telling us

357

00:17:42,269 --> 00:17:45,269

that on day one, they can be productive,

358

00:17:45,692 --> 00:17:48,692

that they have the signals that they can make the right decisions,

359

00:17:49,692 --> 00:17:52,000

that we are not wasting their time,

360

00:17:52,000 --> 00:17:54,885

okay, that their team is productive

361

00:17:54,885 --> 00:17:57,077

is really, really important.

362

00:17:57,077 --> 00:17:58,654

There's no competitive advantage for them.

363

00:17:58,654 --> 00:18:01,038

Yes, yes, for sure.

364

00:18:01,038 --> 00:18:04,038

The second thing is that,

365

00:18:06,038 --> 00:18:08,731

We do see our customers as partners.

366

00:18:08,731 --> 00:18:12,615

I think it's a privilege for us to partner

367

00:18:12,615 --> 00:18:16,077

with them and enable this next, gen innovation.

368

00:18:16,885 --> 00:18:18,808

And it looks differently. Okay.

369

00:18:18,808 --> 00:18:23,231

It looks in a way that our customers, we are not just talking about

370

00:18:24,385 --> 00:18:27,385

the platform or what services we are talking to them about.

371

00:18:27,462 --> 00:18:30,462

What changes do they have, how can we help?

372

00:18:30,577 --> 00:18:33,462

And if they need us, we'll be there, okay.

373

00:18:33,462 --> 00:18:36,462

We'll be there to help them with whatever they need.

374

00:18:36,577 --> 00:18:40,308

And as part of that, state of mind,

375

00:18:40,615 --> 00:18:43,500

it's also impact of how we support our customers.

376

00:18:43,500 --> 00:18:48,654

We are actually not a really like a traditional, support process.

377

00:18:49,154 --> 00:18:52,769

And the way we are working, we believe in engineering

378

00:18:52,769 --> 00:18:57,000

to engineering relationships because our customers are sophisticated.

379

00:18:57,385 --> 00:18:59,000

They have the hardest problem.

380

00:18:59,000 --> 00:19:02,423

They want to know that they can quickly find solutions.

381

00:19:02,885 --> 00:19:05,808

And so we really automated and manage that process.

382

00:19:05,808 --> 00:19:07,231

We call it direct to expert.

383

00:19:08,462 --> 00:19:11,154

And we have engineers working with our customers day

384

00:19:11,154 --> 00:19:14,269

in day out on the hardest problems, same language like

385

00:19:15,269 --> 00:19:18,731

difference from a differentiation perspective and really getting in there

386

00:19:18,731 --> 00:19:22,885

and allowing those experts to deliver to their customers what they expect.

387

00:19:22,885 --> 00:19:23,846

For sure.

388

00:19:23,846 --> 00:19:26,500

So you're not a mind reader, can't predict a future.

389

00:19:26,500 --> 00:19:29,500

But a year from now, GTC 2027,

390

00:19:30,115 --> 00:19:33,115

we think we might see.

391

00:19:33,577 --> 00:19:34,615

That's it.

392

00:19:34,615 --> 00:19:36,577

I keep thinking, I keep thinking about it.

393

00:19:36,577 --> 00:19:40,346

If someone would have told me and that like in

394

00:19:40,346 --> 00:19:43,346

2025 would look the way it is, I would not have believed it.

395

00:19:43,462 --> 00:19:43,692

Okay?

396

00:19:43,692 --> 00:19:47,231

There's been so many things that happened that surprised all of us

397

00:19:47,500 --> 00:19:49,346

from an advancement perspective.

398

00:19:49,346 --> 00:19:53,231

However, I do expect that we will see,

399

00:19:54,654 --> 00:19:56,308

an explosion of use cases.

400

00:19:57,885 --> 00:20:03,769

And. More and more people will speak the AI language.

401

00:20:04,308 --> 00:20:06,038

It's so powerful.

402

00:20:06,038 --> 00:20:09,038

And the tools are becoming so accessible

403

00:20:09,462 --> 00:20:11,346

that I, I think that

404

00:20:11,346 --> 00:20:15,308

we will see less people be worried about it,

405

00:20:15,308 --> 00:20:18,308

because what we are seeing today, that the people that lean in

406

00:20:19,038 --> 00:20:21,500

find that is their superpower.

407

00:20:21,500 --> 00:20:23,346

Yeah. Yes.

408

00:20:23,346 --> 00:20:25,000

Yeah. I love that leaning in.

409

00:20:25,000 --> 00:20:28,000

And thank you so much for an outstanding conversation.

410

00:20:28,115 --> 00:20:31,038

I appreciate you taking some time to be on the podcast and share

411

00:20:31,038 --> 00:20:33,769

what you're talking about, what you're hearing from customers,

412

00:20:33,769 --> 00:20:37,077

the strength of the NVIDIA partnership, and also other partnerships

413

00:20:37,308 --> 00:20:41,846

in that ecosystem that really make AI accessible for customers.

414

00:20:41,846 --> 00:20:43,115

I appreciate your time. Thank you.

415

00:20:43,115 --> 00:20:44,885

Thank you so much for having me.

416

00:20:44,885 --> 00:20:47,231

For Chen Goldberg, I'm Lisa Martin. You're watching

417

00:20:47,231 --> 00:20:50,385

AI Cloud Essentials podcast live from GTC.

418

00:20:50,654 --> 00:20:52,654

Thanks for watching guys. We'll see you on the next pod.