MAXIM在FiveK上的图像修饰预训练模型

MAXIM模型为图像修饰而进行了预训练。它是由Zhengzhong Tu，Hossein Talebi，Han Zhang，Feng Yang，Peyman Milanfar，Alan Bovik，Yinxiao Li在论文 MAXIM: Multi-Axis MLP for Image Processing 中提出，并于 this repository 首次发布。

免责声明：发布MAXIM的团队没有为该模型编写模型卡，因此这个模型卡是由Hugging Face团队编写的。

模型描述

MAXIM引入了一个基于多层感知机（MLP）的共享骨干网络，用于不同的图像处理任务，如图像去模糊、去雨滴、降噪、去雾、低光图像增强和修饰。下图显示了MAXIM的主要组成部分。

训练过程和结果

作者没有发布训练代码。有关模型的训练细节，请参阅 original paper 。

根据 table ，该模型的峰值信噪比（PSNR）为26.15，结构相似性指数（SSIM）为0.945。

预期用途和限制

您可以将原始模型用于图像修饰任务。

该模型的 officially released in JAX 。它已经转换为TensorFlow格式（ this repository ）。

如何使用

以下是使用此模型的方法：

from huggingface_hub import from_pretrained_keras
from PIL import Image

import tensorflow as tf
import numpy as np
import requests

url = "https://github.com/sayakpaul/maxim-tf/raw/main/images/Enhancement/input/748.png"
image = Image.open(requests.get(url, stream=True).raw)
image = np.array(image)
image = tf.convert_to_tensor(image)
image = tf.image.resize(image, (256, 256))

model = from_pretrained_keras("google/maxim-s2-enhancement-fivek")
predictions = model.predict(tf.expand_dims(image, 0))

有关更详细的预测流程，请参阅 this Colab Notebook 。

引用

@article{tu2022maxim,
  title={MAXIM: Multi-Axis MLP for Image Processing},
  author={Tu, Zhengzhong and Talebi, Hossein and Zhang, Han and Yang, Feng and Milanfar, Peyman and Bovik, Alan and Li, Yinxiao},
  journal={CVPR},
  year={2022},
}

作者:

Google AI

数据集大小:

103.38 MB