What an unprocessed photo looks like

(maurycyz.com)

325 points | by zdw 2 hours ago

21 comments

barishnamazov 1 hour ago
I love posts that peel back the abstraction layer of "images." It really highlights that modern photography is just signal processing with better marketing.
A fun tangent on the "green cast" mentioned in the post: the reason the Bayer pattern is RGGB (50% green) isn't just about color balance, but spatial resolution. The human eye is most sensitive to green light, so that channel effectively carries the majority of the luminance (brightness/detail) data. In many advanced demosaicing algorithms, the pipeline actually reconstructs the green channel first to get a high-resolution luminance map, and then interpolates the red/blue signals—which act more like "color difference" layers—on top of it. We can get away with this because the human visual system is much more forgiving of low-resolution color data than it is of low-resolution brightness data. It’s the same psycho-visual principle that justifies 4:2:0 chroma subsampling in video compression.
Also, for anyone interested in how deep the rabbit hole goes, looking at the source code for dcraw (or libraw) is a rite of passage. It’s impressive how many edge cases exist just to interpret the "raw" voltages from different sensor manufacturers.
[-]
- delecti 1 hour ago
  I have a related anecdote.
  When I worked at Amazon on the Kindle Special Offers team (ads on your eink Kindle while it was sleeping), the first implementation of auto-generated ads was by someone who didn't know that properly converting RGB to grayscale was a smidge more complicated than just averaging the RGB channels. So for ~6 months in 2015ish, you may have seen a bunch of ads that looked pretty rough. I think I just needed to add a flag to the FFmpeg call to get it to convert RGB to luminance before mapping it to the 4-bit grayscale needed.
  [-]
  - barishnamazov 50 minutes ago
    I don't think Kindle ads were available in my region in 2015 because I don't remember seeing these back then, but you're a lucky one to fix this classic mistake :-)
    I remember trying out some of the home-made methods while I was implementing a creative work section for a school assignment. It’s surprising how "flat" the basic average looks until you actually respect the coefficients (usually some flavor of 0.21R + 0.72G + 0.07B). I bet it's even more apparent in a 4-bit display.
    [-]
    - kccqzy 16 minutes ago
      I remember using some photo editing software (Aperture I think) that would allow you to customize the different coefficients and there were even presets that give different names to different coefficients. Ultimately you can pick any coefficients you want, and only your eyes can judge how nice they are.
    - reactordev 34 minutes ago
      If you really want that old school NTSC look: 0.3R + 0.59G + 0.11B
      This is the coefficients I use regularly.
- thousand_nights 26 minutes ago
  the bayer pattern is one of those things that makes me irrationally angry, in the true sense, based on my ignorance of the subject
  what's so special about green? oh so just because our eyes are more sensitive to green we should dedicate double the area to green in camera sensors? i mean, probably yes. but still. (⩺_⩹)
- dheera 45 minutes ago
  This is also why I absolute hate, hate, hate it when people ask me whether I "edited" a photo or whether a photo is "original", as if trying to explain away nice-looking images as if they are fake.
  The JPEGs cameras produce are heavily processed, and they are emphatically NOT "original". Taking manual control of that process to produce an alternative JPEG with different curves, mappings, calibrations, is not a crime.
  [-]
  - to11mtm 29 minutes ago
    JPEG with OOC processing is different from JPEG OOPC (out-of-phone-camera) processing. Thank Samsung for forcing the need to differentiate.
    [-]
    - seba_dos1 15 minutes ago
      I wrote the raw Bayer to JPEG pipeline used by the phone I write this comment on. The choices on how to interpret the data are mine. Can I tweak these afterwards? :)
- bstsb 42 minutes ago
  hey, not accusing you of anything (bad assumptions don't lead to a conducive conversation) but did you use AI to write or assist with this comment?
  this is totally out of my own self-interest, no problems with its content
  [-]
  - sho_hn 33 minutes ago
    Upon inspection, the author's personal website used em dashes in 2023. I hope this helped with your witch hunt.
    I'm imagining a sort of Logan's Run-like scifi setup where only people with a documented em dash before November 30, 2022, i.e. D(ash)-day, are left with permission to write.
  - ekidd 20 minutes ago
    I have been overusing em dashes and bulleted lists since the actual 80s, I'm sad to say. I spent much of the 90s manually typing "smart" quotes.
    I have actually been deliberately modifying my long-time writing style and use of punctuation to look less like an LLM. I'm not sure how I feel about this.
    [-]
    - disillusioned 11 minutes ago
      Alt + 0151, baby! Or... however you do it on MacOS.
      But now, likewise, having to bail on emdashes. My last differentiator is that I always close set the emdash—no spaces on either side, whereas ChatGPT typically opens them (AP Style).
      [-]
      - piskov 5 minutes ago
        Just use some typography layout with a separate layer. Eg “right alt” plus “-” for m-dash
        Russians use this for at 20 years
        https://ilyabirman.ru/typography-layout/
  - ajkjk 35 minutes ago
    found the guy who didn't know about em dashes before this year
    also your question implies a bad assumption even if you disclaim it. if you don't want to imply a bad assumption the way to do that is to not say the words, not disclaim them
    [-]
    - reactordev 32 minutes ago
      The hatred mostly comes from TTS models not properly pausing for them.
      “NO EM DASHES” is common system prompt behavior.
Waterluvian 39 minutes ago
I studied remote sensing in undergrad and it really helped me grok sensors and signal processing. My favourite mental model revelation to come from it was that what I see isn’t the “ground truth.” It’s a view of a subset of the data. My eyes, my cat’s eyes, my cameras all collect and render different subsets of the data, providing different views of the subject matter.
It gets even wilder when perceiving space and time as additional signal dimensions.
I imagine a sort of absolute reality that is the universe. And we’re all just sensor systems observing tiny bits of it in different and often overlapping ways.
userbinator 1 hour ago
I think everyone agrees that dynamic range compression and de-Bayering (for sensors which are colour-filtered) are necessary for digital photography, but at the other end of the spectrum is "use AI to recognise objects and hallucinate what they 'should' look like" --- and despite how everyone would probably say that isn't a real photo anymore, it seems manufacturers are pushing strongly in that direction, raising issues with things like admissibility of evidence.
[-]
- stavros 1 hour ago
  One thing I've learned while dabbling in photography is that there are no "fake" images, because there are no "real" images. Everything is an interpretation of the data that the camera has to do, making a thousand choices along the way, as this post beautifully demonstrates.
  A better discriminator might be global edits vs local edits, with local edits being things like retouching specific parts of the image to make desired changes, and one could argue that local edits are "more fake" than global edits, but it still depends on a thousand factors, most importantly intent.
  "Fake" images are images with intent to deceive. By that definition, even an image that came straight out of the camera can be "fake" if it's showing something other than what it's purported to (e.g. a real photo of police violence but with a label saying it's in a different country is a fake photo).
  What most people think when they say "fake", though, is a photo that has had filters applied, which makes zero sense. As the post shows, all photos have filters applied. We should get over that specific editing process, it's no more fake than anything else.
  [-]
  - mmooss 59 minutes ago
    > What most people think when they say "fake", though, is a photo that has had filters applied, which makes zero sense. As the post shows, all photos have filters applied.
    Filters themselves don't make it fake, just like words themselves don't make something a lie. How the filters and words are used, whether they bring us closer or further from some truth, is what makes the difference.
    Photos implicitly convey, usually, 'this is what you would see if you were there'. Obviously filters can help with that, as in the OP, or hurt.
  - melagonster 5 minutes ago
    Today, I trust the other meaning of "fake images" is that an image was generated by AI.
  - userbinator 1 hour ago
    Everything is an interpretation of the data that the camera has to do
    What about this? https://news.ycombinator.com/item?id=35107601
    [-]
    - mrandish 38 minutes ago
      News agencies like AP have already come up with technical standards and guidelines to technically define 'acceptable' types and degrees of image processing applied to professional photo-journalism.
      You can look it up because it's published on the web but IIRC it's generally what you'd expect. It's okay to do whole-image processing where all pixels have the same algorithm applied like the basic brightness, contrast, color, tint, gamma, levels, cropping, scaling, etc filters that have been standard for decades. The usual debayering and color space conversions are also fine. Selectively removing, adding or changing only some pixels or objects is generally not okay for journalistic purposes. Obviously, per-object AI enhancement of the type many mobile phones and social media apps apply by default don't meet such standards.
    - mgraczyk 19 minutes ago
      I think Samsung was doing what was alleged, but as somebody who was working on state of the art algorithms for camera processing at a competitor while this was happening, this experiment does not prove what is alleged. Gaussian blurring does not remove the information, you can deconvolve and it's possible that Samsung's pre-ML super resolution was essentially the same as inverting a gaussian convolution
  - xgulfie 1 hour ago
    There's an obvious difference between debayering and white balance vs using Photoshop's generative fill
    [-]
    - sho_hn 29 minutes ago
      Pretending that "these two things are the same, actually" when in fact no, you can seperately name and describe them quite clearly, is a favorite pastime of vacuous content on the internet.
      Artists, who use these tools with clear vision and intent to achieve specific goals, strangely never have this problem.
  - to11mtm 52 minutes ago
    Well that's why back in the day (and even still) 'Photographer listing their whole kit for every shot' is a thing thing you sometimes see.
    i.e. Camera+Lens+ISO+SS+FStop+FL+TC (If present)+Filter (If present). Add focus distance if being super duper proper.
    And some of that is to help at least provide the right requirements to try to recreate.
  - teeray 1 hour ago
    > "Fake" images are images with intent to deceive
    The ones that make the annual rounds up here in New England are those foliage photos with saturation jacked. “Look at how amazing it was!” They’re easy to spot since doing that usually wildly blows out the blues in the photo unless you know enough to selectively pull those back.
    [-]
    - dheera 40 minutes ago
      Photography is also an art. When painters jack up saturations in their choices of paint colors people don't bat an eyelid. There's no good reason photographers cannot take that liberty as well, and tone mapping choices is in fact a big part of photographers' expressive medium.
      If you want reality, go there in person and stop looking at photos. Viewing imagery is a fundamentally different type of experience.
      [-]
      - zmgsabst 24 minutes ago
        Sure — but people reasonably distinguish between photos and digital art, with “photo” used to denote the intent to accurately convey rather than artistic expression.
        We’ve had similar debates about art using miniatures and lens distortions versus photos since photography was invented — and digital editing fell on the lens trick and miniature side of the issue.
        [-]
        dheera 18 minutes ago
        Journalistic/event photography is about accuracy to reality, almost all other types of photography are not.
        Portrait photography -- no, people don't look like that in real life with skin flaws edited out
        Landscape photography -- no, the landscapes don't look like that 99% of the time, the photographer picks the 1% of the time when it looks surreal
        Staged photography -- no, it didn't really happen
        Street photography -- a lot of it is staged spontaneously
        Product photography -- no, they don't look like that in normal lighting
  - nospice 1 hour ago
    > A better discriminator might be global edits vs local edits,
    Even that isn't all that clear-cut. Is noise removal a local edit? It only touches some pixels, but obviously, that's a silly take.
    Is automated dust removal still global? The same idea, just a bit more selective. If we let it slide, what about automated skin blemish removal? Depth map + relighting, de-hazing, or fake bokeh? I think that modern image processing techniques really blur the distinction here because many edits that would previously need to be done selectively by hand are now a "global" filter that's a single keypress away.
    Intent is the defining factor, as you note, but intent is... often hazy. If you dial down the exposure to make the photo more dramatic / more sinister, you're manipulating emotions too. Yet, that kind of editing is perfectly OK in photojournalism. Adding or removing elements for dramatic effect? Not so much.
    [-]
    - card_zero 41 minutes ago
      What's this, special pleading for doctored photos?
      The only process in the article that involves nearby pixels is to combine R G and B (and other G) into one screen pixel. (In principle these could be mapped to subpixels.) Everything fancier than that can be reasonably called some fake cosmetic bullshit.
      [-]
      - seba_dos1 5 minutes ago
        The article doesn't even go anywhere near what you need to do in order to get an acceptable output. It only shows the absolute basics. If you apply only those to a photo from a phone camera, it will be massively distorted (the effect is smaller, but still present on big cameras).
      - nospice 31 minutes ago
        I honestly don't understand what you're saying here.
        [-]
        card_zero 24 minutes ago
        I can't see how to rephrase it. How about this:
        Removing dust and blemishes entails looking at more than one pixel at a time.
        Nothing in the basic processing described in the article does that.
  - imiric 1 hour ago
    I understand what you and the article are saying, but what GP is getting at, and what I agree with, is that there is a difference between a photo that attempts to reproduce what the "average" human sees, and digital processing that augments the image in ways that no human could possibly visualize. Sometimes we create "fake" images to improve clarity, detail, etc., but that's still less "fake" than smoothing skin to remove blemishes, or removing background objects. One is clearly a closer approximation of how we perceive reality than the other.
    So there are levels of image processing, and it would be wrong to dump them all in the same category.
krackers 1 hour ago
>if the linear data is displayed directly, it will appear much darker then it should be.
This seems more a limitation of monitors. If you had very large bit depth, couldn't you just display images in linear light without gamma correction.
[-]
- Sharlin 18 minutes ago
  No. It's about the shape of the curve. Human light intensity perception is not linear. You have to nonlinearize at some point of the pipeline, but yes, typically you should use high-resolution (>=16 bits per channel) linear color in calculations and apply the gamma curve just before display. The fact that traditionally this was not done, and linear operations like blending were applied to nonlinear RGB values, resulted in ugly dark, muddy bands of intermediate colors even in high-end applications like Photoshop.
- AlotOfReading 1 hour ago
  Correction is useful for a bunch of different reasons, not all of them related to monitors. Even ISP pipelines without displays involved will still usually do it to allocate more bits to the highlights/shadows than the relatively distinguishable middle bits. Old CRTs did it because the electron gun had a non-linear response and the gamma curve actually linearized the output. Film processing and logarithmic CMOS sensors do it because the sensing medium has a nonlinear sensitivity to the light level.
- dheera 32 minutes ago
  If we're talking about a sunset, then we're talking about your monitor shooting out blinding, eye-hurting brightness light wherever the sun is in the image. That wouldn't be very pleasant.
yoonwoosik12 3 minutes ago
This is really interesting. I'll be back after reading it.
exabrial 34 minutes ago
I love the look of the final product after the manual work (not the one for comparison). Just something very realistic and wholesome about it, not pumped to 10 via AI or Instagram filters.
strogonoff 27 minutes ago
An unprocessed photo does not “look”. It is RGGB pixel values that far exceed any display media in dynamic range. Fitting it into the tiny dynamic range of screens by thrusting throwing away data strategically (inventing perceptual the neutral grey point, etc.) is what actually makes sense of them, and what is the creative task.
ChrisMarshallNY 16 minutes ago
That's a cool walkthrough.
I spent a good part of my career, working in image processing.
That first image is pretty much exactly what a raw Bayer format looks like.
throw310822 1 hour ago
Very interesting, pity the author chose such a poor example for the explanation (low, artificial and multicoloured light), making it really hard to understand what the "ground truth" and expected result should be.
[-]
- delecti 1 hour ago
  I'm not sure I understand your complaint. The "expected result" is either of the last two images (depending on your preference), and one of the main points of the post is to challenge the notion of "ground truth" in the first place.
  [-]
  - throw310822 1 hour ago
    Not a complaint, but both the final images have poor contrast, lighting, saturation and colour balance, making them a disappointing target for an explanation of how these elements are produced from raw sensor data.
    But anyway, I enjoyed the article.
jonplackett 15 minutes ago
The matrix step has 90s video game pixel art vibes.
reactordev 38 minutes ago
Maybe it’s just me but I took one look at the unprocessed photo (the first one) and immediately knew it was a skinny Christmas tree.
I’ve been staring at 16-bit HDR greyscale space for so long…
uolmir 1 hour ago
This is a great write up. It's also weirdly similar to a video I happened upon yesterday playing around with raw Hubble imagery: https://www.youtube.com/watch?v=1gBXSQCWdSI
He take a few minutes to get to the punch line. Feel free to skip ahead to around 5:30.
emodendroket 1 hour ago
This is actually really useful. A lot of people demand an "unprocessed" photo but don't understand what they're actually asking for.
XCSme 1 hour ago
I am confused by the color filter step.
Is the output produced by the sensor RGB or a single value per pixel?
[-]
- steveBK123 1 hour ago
  In its most raw form, camera sensors only see illumination not color.
  In front of the sensor is a bayer filter which results in each physical pixel seeing illumination filtered R G or B.
  From there the software onboard the camera or in your RAW converter does interpolation to create RGB values at each pixel. For example if the local pixel is R filtered, it then interpolates its G & B values from nearby pixels of that filter.
  https://en.wikipedia.org/wiki/Bayer_filter
  There are alternatives such as what Fuji does with its X-trans sensor filter.
  https://en.wikipedia.org/wiki/Fujifilm_X-Trans_sensor
  Another alternative is Foveon (owned by Sigma now) which makes full color pixel sensors but they have not kept up with state of the art.
  https://en.wikipedia.org/wiki/Foveon_X3_sensor
  This is also why Leica B&W sensor cameras have higher apparently sharpness & ISO sensitivity than the related color sensor models because there is no filter in front or software interpolation happening.
  [-]
  - XCSme 59 minutes ago
    What about taking 3 photos while quickly changing the filter (e.g. filters are something like quantum dots that can be turned on/off)?
    [-]
    - lidavidm 14 minutes ago
      Olympus and other cameras can do this with "pixel shift": it uses the stabilization mechanism to quickly move the sensor by 1 pixel.
      https://en.wikipedia.org/wiki/Pixel_shift
      EDIT: Sigma also has "Foveon" sensors that do not have the filter and instead stacks multiple sensors (for different wavelengths) at each pixel.
      https://en.wikipedia.org/wiki/Foveon_X3_sensor
    - MarkusWandel 37 minutes ago
      Works for static images, but if there's motion the "changing the filters" part is never fast enough, there will always be colour fringing somewhere.
      Edit or maybe it does work? I've watched at least one movie on a DLP type video projector with sequential colour and not noticed colour fringing. But still photos have much higher demand here.
    - itishappy 50 minutes ago
      > What about taking 3 photos while quickly changing the filter
      Works great. Most astro shots are taken using a monochrome sensor and filter wheel.
      > filters are something like quantum dots that can be turned on/off
      If anyone has this tech, plz let me know! Maybe an etalon?
      https://en.wikipedia.org/wiki/Fabry%E2%80%93P%C3%A9rot_inter...
      [-]
      - XCSme 42 minutes ago
        > If anyone has this tech, plz let me know!
        I have no idea, it was my first thought when I thought of modern color filters.
        [-]
        card_zero 28 minutes ago
        That's how the earliest color photography worked. "Making color separations by reloading the camera and changing the filter between exposures was inconvenient", notes Wikipedia.
        [-]
        to11mtm 12 minutes ago
        I think they are both more asking about 'per pixel color filters'; that is, something like a sensor filter/glass but the color separators could change (at least 'per-line') fast enough to get a proper readout of the color in formation.
        AKA imagine a camera with R/G/B filters being quickly rotated out for 3 exposures, then imagine it again but the technology is integrated right into the sensor (and, ideally, the sensor and switching mechanism is fast enough to read out with rolling shutter competitive with modern ILCs)
  - stefan_ 51 minutes ago
    B&W sensors are generally more sensitive than their color versions, as all filters (going back to signal processing..) attenuate the signal.
- wtallis 1 hour ago
  The sensor outputs a single value per pixel. A later processing step is needed to interpret that data given knowledge about the color filter (usually Bayer pattern) in front of the sensor.
- i-am-gizm0 54 minutes ago
  The raw sensor output is a single value per sensor pixel, each of which is behind a red, green, or blue color filter. So to get a usable image (where each pixel has a value for all three colors), we have to somehow condense the values from some number of these sensor pixels. This is the "Debayering" process.
- ranger207 1 hour ago
  It's a single value per pixel, but each pixel has a different color filter in front of it, so it's effectively that each pixel is one of R, G, or B
  [-]
  - XCSme 1 hour ago
    So, for a 3x3 image, the input data would be 9 values like:
```
   R G B
   B R G
   G B R

?
```
    [-]
    - Lanzaa 25 minutes ago
      This depends on the camera and the sensor's bayer filter [0]. For example the quad bayer uses a 4x4 like:
      G G R R G G R R B B G G B B G G
      [0]: https://en.wikipedia.org/wiki/Bayer_filter
    - jeeyoungk 55 minutes ago
      If you want "3x3 colored image", you would need 6x6 of the bayer filter pixels.
      Each RGB pixel would be 2x2 grid of
``` G R B G ```
So G appears twice as many as other colors (this is mostly the same for both the screen and sensor technology).
There are different ways to do the color filter layouts for screens and sensors (Fuji X-Trans have different layout, for example).
- card_zero 58 minutes ago
  In the example ("let's color each pixel ...") the layout is:
  R G G B
  Then at a later stage the image is green because "There are twice as many green pixels in the filter matrix".
  [-]
  - nomel 24 minutes ago
    And this is important because our perception is more sensitive to luminance changes than color, and with our eyes being most sensitive to green, luminance is also. So, higher perceived spatial resolution by using more green [1]. This is also why JPG has lower resolution red and green channels, and why modern OLED usually use a pentile display, with only green being at full resolutio [2].
    [1] https://en.wikipedia.org/wiki/Bayer_filter#Explanation
    [2] https://en.wikipedia.org/wiki/PenTile_matrix_family
    [-]
    userbinator 10 minutes ago
    Pentile displays are acceptable for photos and videos, but look really horrible displaying text and fine detail --- which looks almost like what you'd see on an old triad-shadow-mask colour CRT.
    card_zero 14 minutes ago
    Funny that subpixels and camera sensors aren't using the same layouts.
Forgeties79 59 minutes ago
For those who are curious, this is basically what we do when we color grade in video production but taken to its most extreme. Or rather, stripped down to the most fundamental level. Lots of ways to describe it.
Generally we shoot “flat” (there are so many caveats to this but I don’t feel like getting bogged down in all of it. If you plan on getting down and dirty with colors and really grading, you generally shoot flat). The image that we handover to DIT/editing can be borderline grayscale in its appearance. The colors are so muted, the dynamic range is so wide, that you basically have a highly muted image. The reason for this is you then have the freedom to “push” the color and look and almost any direction, versus if you have a very saturated, high contrast image, you are more “locked” into that look. This matters more and more when you are using a compressed codec and not something with an incredibly high bitrate or raw codecs, which is a whole other world and I am also doing a bit of a disservice to by oversimplifying.
Though this being HN it is incredibly likely I am telling few to no people anything new here lol
[-]
- nospice 34 minutes ago
  "Flat" is a bit of a misnomer in this context. It's not flat, it's actually a logarithmic ("log profile") representation of data computed by the camera to allow a wider dynamic range to be squeezed into traditional video formats.
  It's sort of the opposite of what's going on with photography, where you have a dedicated "raw" format with linear readings from the sensor. Without these formats, someone would probably have invented "log JPEG" or something like that to preserve more data in highlights and in the shadows.
  [-]
gruez 42 minutes ago
Honestly, I think the gamma normalization step don't really count as "processing", any more than the gzip decompression step doesn't count as "processing" for the purposes of "this is what an unprocessed html file looks like" demo. At the end of the day, it's the same information, but encoded differently. Similar arguments can be made for de-bayer filter step. If you ignore these two steps, the "processing" that happens looks far less dramatic.
to11mtm 50 minutes ago
OK now do Fuji Super CCD (where for reasons unknown the RAW is diagonal [0])
[0] - https://en.wikipedia.org/wiki/Super_CCD#/media/File:Fuji_CCD...
[-]
- bri3d 27 minutes ago
  The reasons aren’t exactly unknown, considering that the sensor is diagonally oriented also?
  Processing these does seem like more fun though.
shepherdjerred 59 minutes ago
Wow this is amazing. What a good and simple explanation!
mvdtnz 51 minutes ago
The article keeps using the acronym "ADC" without defining it.
[-]
- packetslave 15 minutes ago
  Right-click, "Search Google for 'ADC'", takes much less time than making this useless comment.
  https://en.wikipedia.org/wiki/Analog-to-digital_converter
- benatkin 39 minutes ago
  There are also no citations, and it has this phrase "This website is not licensed for ML/LLM training or content creation." Yeah right, that's like the privacy notice posts people make to facebook from time to time that contradict the terms of service https://knowyourmeme.com/memes/facebook-privacy-notices
jiggawatts 12 minutes ago
I've been studying machine learning during the xmas break, and as an exercise I started tinkering around with the raw Bayer data from my Nikon camera, throwing it at various architectures to see what I can squeeze out of the sensor.
Something that surprised me is that very little of the computation photography magic that has been developed for mobile phones has been applied to larger DSLRs. Perhaps it's because it's not as desperately needed, or because prior to the current AI madness nobody had sufficient GPU power lying around for such a purpose.
For example, it's a relatively straightforward exercise to feed in "dark" and "flat" frames as extra per-pixel embeddings, which lets the model learn about the specifics of each individual sensor and its associated amplifier. In principle, this could allow not only better denoising, but also stretch the dynamic range a tiny bit by leveraging the less sensitive photosites in highlights and the more senstive ones in the dark areas.
Similarly, few if any photo editing products do simultaneous debayering and denoising, most do the latter as a step in normal RGB space.
Not to mention multi-frame stacking that compensates for camera motion, etc...
The whole area is "untapped" for full-frame cameras, someone just needs to throw a few server grade GPUs at the problem for a while!
killingtime74 1 hour ago
Very cool and detailed