-
Notifications
You must be signed in to change notification settings - Fork 381
Add GSoC 2026 introductory blog post for Soumyadip Niyogi #1929
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,42 @@ | ||
| --- | ||
| project: HSF | ||
| title: Integration of CMS Combine with FCCAnalyses | ||
| author: Soumyadip Niyogi | ||
| photo: blog_authors/SoumyadipNiyogi.jpg # Upload your square photo to the blog_authors directory | ||
| avatar: https://avatars.githubusercontent.com/captainvogon # Replace with your GitHub avatar URL | ||
| date: 19.05.2026 | ||
| year: 2026 | ||
| layout: blog_post | ||
| logo: hsf_logo_angled.png | ||
| intro: | | ||
| This summer, as a Google Summer of Code contributor with the HEP Software Foundation at CERN, I am bridging the gap between FCCAnalyses and CMS Combine. My project aims to build a native Python interface that automates the generation of datacards and RooFit workspaces directly from RDataFrame histograms, streamlining the path from simulated events to physics results. | ||
| --- | ||
|
|
||
| | | | | ||
| | --- | --- | | ||
| | Name | [Soumyadip Niyogi](https://github.com/captainvogon) | | ||
| | Organisation | [IISER Thiruvananthapuram](https://www.iisertvm.ac.in/), [CERN](https://home.cern/), [HSF](https://hepsoftwarefoundation.org/) | | ||
| | Mentor | [Juraj Smiesko](https://github.com/kjvbrt) (CERN), [Jan Eysermans](https://github.com/jeyserma) (MIT) | | ||
| | Project | [Integration of CMS Combine with FCCAnalyses](https://summerofcode.withgoogle.com/programs/2026/projects/FyDjzZ3Y) | | ||
|
|
||
| ## Introduction | ||
| Hi! I am Soumyadip Niyogi, a third-year BS-MS Physics student at IISER Thiruvananthapuram, India, and this summer I will be working with the HEP Software Foundation as a Google Summer of Code contributor at CERN. | ||
|
|
||
| My research so far has lived mostly in cosmology: MCMC-based constraints on modified gravity, Bayesian inference on Gamma-Ray Burst datasets, and simulating relativistic electrons in galaxy cluster magnetic fields. This project is my first serious step into collider physics, and I could not have asked for a better starting point. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Can you put some link which explains "MCMC-based constraints" |
||
|
|
||
| ## The Project | ||
| The Future Circular Collider (FCC) programme at CERN uses FCCAnalyses as its primary analysis framework. Built on ROOT's RDataFrame, it efficiently processes simulated EDM4hep collision events into histograms. But the next step, turning those histograms into actual physics results like discovery significances or exclusion limits, requires CMS Combine. Combine is a separate statistical tool that FCC physicists currently have to interface with manually. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. "EDM4hep collision" -> "EDM4hep based", can you provide EDM4hep link here? "requires CMS Combine" -> "requires a statistical fitting tool, like CMS Combine" (just to express that CMS Combine is not the only option). |
||
|
|
||
| That manual step is what I am here to automate. By the end of the summer, the goal is to have a native Python interface inside FCCAnalyses that reads RDataFrame output histograms and automatically generates the datacards and RooFit workspaces that Combine needs. The first piece of work is packaging Combine itself inside the Key4hep software stack so it is available as a standard dependency, requiring no separate installation. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Key4hep link? |
||
|
|
||
| ## Community Bonding: Getting Up to Speed | ||
| I have been using the community bonding period to understand the landscape on both sides of the bridge I am building. | ||
|
|
||
| On the FCCAnalyses side, I have been working through the benchmark $Z(\mu\mu)H(bb)$ analysis tutorial. This involved configuring the histmaker, stacking signal and background processes with the plotting utilities, and migrating my workspace from AFS to EOS to handle larger ROOT files. Generating recoil mass plots and studying b-tagging score distributions has given me a concrete picture of exactly what data structures my integration code will need to consume. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. "configuring the histmaker" -> "executing the histmaker style analysis". Will this |
||
|
|
||
| On the Combine side, I have been reading through the datacard format documentation and studying how RooFit workspaces encode shape systematics-the machinery that will form the core of the later project phases. I also had to tackle my first technical hurdle: setting up the Combine environment on CERN's `lxplus9` clusters without conflicting with the host's Key4hep stack. By isolating the environment using a Singularity container (`--cleanenv`) and building a local CMSSW workspace, I successfully compiled the tool and ran my first asymptotic limits check! | ||
|
captainvogon marked this conversation as resolved.
|
||
|
|
||
| I will also be attending the group meeting with the FCCAnalyses team to introduce myself to everyone else, which should be highly beneficial for my future workflow and collaboration. | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. its meeting of the whole CERN EP-FCC group, not just FCCAnalyses :) |
||
|
|
||
| ## Acknowledgements | ||
| Thank you to my mentors, Juraj Smiesko (CERN) and Jan Eysermans (MIT), for their time and guidance during the bonding period. More updates to follow as the coding phase begins! | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you use https://kjvbrt.org instead?