NYU Internship
So, I have finally completed my internship at NYU Tandon for this summers and am now returning back on my flight. I was at the Centre for Cyber Security under Prof. Justin Cappos and working on their in-house Supply Chain Security Tool - in-toto.
I’ll start with my experience in the city then bore you with the project.
The city
New York City is an amazing city. It has tonnes of museums to visit and enjoy and epic skyscrapers. The focus on preserving art, culture and history is something I’ve never seen before.
The diversity in the food is awesome. I had Mexican, Chinese, Japanese, Korean as well as Italian cuisine.
While working on the project, I got the oppurtunity to visit the HQ of a software company and the experience was awesome.
The Project
in-toto
in-toto is a powerful tool which allows specifying supply chain steps in detail, cryptographically signing them and later on verifying them independently using a generated link metadata file. Using it, companies and organizations can easily secure their supply chains and clients using their products can verify the authenticity of what they are using.
I worked on adding some minor features in in-toto such as adding support for using ED25519 Keys for signing metadata instead of just RSA Keys.
Debian Rebuilder
This was the major focus of my summers. The people at reproducible builds and debian are working very hard to ensure that all packages in Debian are reproducible. Once they have rebuilt any such package, they publish the buildinfo file for others to see.
Reproducible builds are very important for getting verifiability in a package distribution ecosystem because if the build system on which the package was built initially is compromised, nobody would know otherwise. With reproducible builds, we can have multiple independent entities build a package and then compare them to ensure that nothing was compromised.
For this purpose, the people on debian were working on an easily deployable system which can be used to rebuild packages and publish the buildinfo generated by this system and establish a baseline.
For what? Fidelity.
I worked majorly on setting up this system wherein, given a buildinfo file, the system would rebuild the whole package from source and then publish the buildinfo. We took it a step further and added in-toto metadata generation during the build and exposed the metadata.
An organization can just deploy the system with ansible playbook.yml
and have the entire
infrastructure exposing the buildinfo as well as metadata running. Now, theoretically, anyone
installing a package from the debian repositories could lookup the exposed metadata and the metadata
of what they have downloaded to ensure that the package hasn’t been compromised. This could even
be automated by using an apt-transport/plugin to automatically verify and only install a
package if a certain threshold is met.
Some work on this is still left. The code and documentation can be found at salsa.debian.org