Session Full Program · Contributors · Organizations · Search Program · Flagged · Happening NowMore…Search ProgramFlaggedHappening NowWork-in-Progress Poster: Tuesday Work-in-Progress PostersEvent TypeWork-in-Progress PosterTimeTuesday, June 256:00pm - 7:00pm PDTLocationLevel 2 LobbyTopicsAIAutonomous SystemsCloudDesignEDAEmbedded SystemsIPSecurityPresentationsHPA: A novel IS-WS hybrid data flow for PIM architecturesAuthorsYun ZhaoSheng MaHeng LiuLi HuangLi WuJian ZhangChun ZhangTie LiUnderstanding the Upper Bounds of Energy Efficiency in a Computing-in-Memory Processor and How to Approach the LimitAuthorsZhaori CongJinshan YueShengzhe YanZhuoyu DaiZeyu GuoZhihang QianYifan Hesun wenyuChunmeng DouFeng ZhangYongpan LiuAdditive Partial Sum QuantizationAuthorsPingcheng DongYonghao TanDong ZhangYongkun WuXijie HuangShi-Yang LiuYu LiuXuejiao LiuPeng LuoLuhong LiangFengwei AnKwang-Ting ChengFrom RTL to Prompt: AN LLM-assisted Verification Methodology for General ProcessorAuthorsYifei DengChao XiaoZhijie YangRenzhi ChenYuanfeng LuoJingyue ZhaoHuadong DaiLei WangYuhua TangWeixia XuAn Effective Timing Driven Placement with Accurate Differentiable Timing Approximation IntegrationAuthorsXu HeRenjun ZhaoChenjing YangYushan WangYao WangPeiyu LiaoYibo LinBei YuSPHINCSLET - A Lightweight Implementation of SPHINCS+AuthorsSanjay DeshpandeYongseok LeeCansu KarakuzuJakub SzeferYunheung PaekLearned Index Acceleration with FPGAs: A SMART ApproachAuthorGeetesh MoreTRIFP-DCIM: A Toggle-Rate-Immune Floating-point Digital Compute-in-Memory Design with Adaptive-Asymmetric Compute-TreeAuthorsXing WangTianhui JiaoYuchen MaZhican ZhangZhichao LiuXi ChenXin SiPixelPrune: Sparse Object Detection for AIoT Systems via In-Sensor Segmentation and Adaptive Data TransferAuthorsMohammadReza MohammadiMehrdad MorsaliBrendan ReidySepehr TabrizchiMohsen ImaniArman RoohiShaahin AngiziRamtin ZandScaler-FFT: A Scalable FPGA-based FFT Accelerator via General Matrix MultiplicationAuthorsSong ZhangZhiyuan MaZexu ZhangYueyin BaiKun WangAddressing the Diversity in AI Computing: An On-chip Programmable AcceleratorAuthorsGopikrishnan Raveendran NairFengyang JiangJeff ZhangJae-sun SeoYu CaoInteractive Visual Performance Space Exploration of Analog ICs with Neural Network Surrogate ModelsAuthorsYannick UhlmannTill MoldenhauerJürgen ScheibleELF: Efficient Logic Synthesis by Pruning Redundancy in RefactoringAuthorsDimitrios TsarasLei ChenXing LiWeihua ShengZhiyao XieMingxuan YuanA Quantum Solver for the Boolean Matching ProblemAuthorsMarco VenereAlessandro BarenghiGerardo PelosiTinySeg: Memory-efficient Image Segmentation for Small Embedded SystemsAuthorsByungchul ChaeJiae KimSeonyeong HeoPre-Silicon Power Side-channel Leakage Assessment of CRYSTALS-KyberAuthorsNashmin AlamTao ZhangFarimah FarahmandiTripartite Server Mutual Attestation: TEE-based BFT for Boosting Server Reliability in Federated LearningAuthorsYusen WuPhuong NguyenYelena YeshaPushing Computing-in-memory towards Computational Storage to Accelerate In-Orbit Remote Sensing Satellite Image ProcessingAuthorsHongyang HuShengwen LiangKai XiZizhen LiuJinshan YueDashan ShangXiaoxin XuJing LiuChunmeng DouMing LiuCDA: Collaborative Computing Using Centralized-Distributed Architecture for Smart SensingAuthorsErxiang RenCheng QuLi LuoYonghua LiZheyu LiuXinghua YangQi WeiFei QiaoAn Application of Information Flow Tracking to Hardware Trojan DetectionAuthorsRyoichi IsawaNobuyuki KanayaDaisuke InoueIntegrated MAC-based Systolic Arrays: Design and Performance EvaluationAuthorsDantu Nandini DeviGandi Ajay KumarBindu G GowdaMadhav RaoMethodology to define, design and support ultra-low voltage Digital DesignAuthorsArnab KhawasVarun IthalBadarish SubbnnavarAMARETTO: Enabling Efficient Quantum Algorithm Emulation on Low-Tier FPGAsAuthorsChristian ContiDeborah VolpeMariagrazia GrazianoMaurizio ZamboniGiovanna TurvaniDeputy NoC: A Case of Low Cost Network-on-Chip for Neural Network Accelerations on GPUsAuthorsKhoa HoSiamak BiglariJustin GarrigusHui ZhaoRADAR: A Skew-resistant and Hotness-aware Ordered Index Design for Processing-in-memory SystemsAuthorsYifan HuaShengan ZhengWeihan KongCong ZhouKaixin HuangRuoyan MaYifeng HuiLinpeng HuangQuantifying the Energy Efficiency Benefits of Monolithic 3D Refreshless Embedded-DRAMAuthorsDavid KongShvetank PrakashJedrzej KufelGeorgios KyriazidisYasmine OmriDavid VerityEmre OzerVijay Janapa ReddiGage HillsQuBound: An Efficient Workflow Enabling Prediction of Performance Bounds under Unpredictable Quantum NoiseAuthorsjinyang liSamudra DasguptaTravis HumbleWeiwen JiangNeuroSteiner: A Graph Transformer for Wirelength EstimationAuthorsSahil ManchandaDana KianfarMarkus PeschlRomain LepertMichael DefferrardSoCureLLM: An LLM-driven Approach for Large-Scale System-on-Chip Security Verification and Policy GenerationAuthorsShams TarekDipayan SahaSujan Kumar SahaMark TehranipoorFarimah FarahmandiA High-Throughput, Energy-Efficient, and Constant-Time In-SRAM AES Engine with Massively-Parallel Bit-Serial ExecutionAuthorsAndrew DervayWenfeng ZhaoThe chipSECS Hardware Trojan Benchmark Suite and Verification MethodologyAuthorChristian KriegAre Adversarial Examples Suitable To Be Test Suites for Testing Deep Neural NetworksAuthorWei KongAn Analytical Fidelity Model for Readout Circuitry with Multiple Co-Existing Non-Idealities for Superconducting Quantum ComputingAuthorsYao TongQuan ChenThe Power of Graph Signal Processing for Chip PlacementAuthorsYiting LiuHai ZhouJia WangFan YangXuan ZengLi ShangGPU-Accelerated BFS for Dynamic NetworksAuthorsFilippo ZicheRosalba GiugnoFederico BusatoNicola BombieriAiDAC: A Low-Cost In-Memory Computing Architecture with All-Analog Multibit Compute and InterconnectAuthorsZihao XuanSong ChenKang YiAutoFlow: Inferring Message Flows From System Communication TracesAuthorsBardia NadimiHao ZhengAdaptive Neurosurgeon: DNN Computing Latency Minimization for Mobile Edge IntelligenceAuthorsGang WuQianru WangBiao HuESFA: An Efficient Scalable FFT Accelerator Design Framework on Versal AI EngineAuthorsHao YangLinfeng DuWei ZhangExploring Distributed Circuit Design Using Single-Step Reinforcement LearningAuthorsJiayu LiMasood MortazaviNing YanEnabling Fast 2-bit LLM on GPUs: Memory Alignment, Sparse Outlier, and Asynchronous DequantizationAuthorsJinhao LiShiyao LiJiaming XuShan HuangJun LiuYaoxiu LianYu WangGuohao DaiFastSample: Accelerating Distributed Graph Neural Network Training for Billion-Scale GraphsAuthorsHesham MostafaAdam GrabowskiMd Asadullah TurjaJuan CervinoAlejandro RibeiroNageen HimayatOn Optimization of Robustness of Inter- and Intra-chiplet Interconnection Topology for Multi-chiplet SystemsAuthorsMiao XuXiaohang WangAmit Kumar SinghYingtao JiangMei YangMethodology of configurable memory conflict-free Number Theoretic Transform accelerator for FPGA platformAuthorsXiangchen MengYangdi LyuCooling the Chaos: Mitigating the Effect of Threshold Voltage Variation in Cryogenic CMOS MemoriesAuthorsRakshith SaligramAmol GaidhaneSuman DattaYu CaoArijit RaychowdhuryA Synthesis Methodology for Intelligent Memory Interfaces in Accelerator SystemsAuthorsAnkur LimayeNicolas Bohm AgostiniClaudio BaroneVito Giovanni CastellanaMichele FioritoFabrizio FerrandiAndres MarquezAntonino TumeoA Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband ProcessingAuthorsLimin JiangYi ShiHaiqin HuQingyu DengSiyi XuYintao LiuFeng YuanSi WangYihao ShenFangfang YeShan CaoZhiyuan JiangScalable Multi-task Deep Inference on Resource Constrained Energy Harvesting SystemAuthorsSahidul IslamBin LeiShanglin ZhouChen PanCaiwen DingMimi XieAnalytical Modeling and Electro-Thermal Benchmarking of 2.5D/3D Heterogeneous Integration for AI ComputingAuthorsZhenyu WangJingbo SunA. Alper GoksoySumit MandalJae-sun SeoVidya A. ChhabriaJeff ZhangChaitali ChakrabartiUmit OgrasYu CaoFrom RTL to SVA: LLM-assisted generation of Formal Verification TestbenchesAuthorsMarcelo Orenes-VeraMargaret MartonosiDavid WentzlaffRepresentation-Independent Resubstitution for Area-Oriented Logic OptimizationAuthorsAndrea CostamagnaAlessandro Tempia CalvinoAlan MishchenkoSatrajit ChatterjeeSiang-Yun LeeGiovanni De MicheliArchitectural Exploration of Application-Specific Resonant SRAM Compute-in-Memory (rCiM)AuthorsDhandeep ChallagundlaIgnatius BezzamRiadul IslamODILO: On-Device Incremental Learning Via Lightweight OperationsAuthorsQing WangDi LiuShengfa MiaoMingxiong ZhaoOptimal Toffoli-Depth Quantum AdderAuthorsSiyi WangAnkit MondalAnupam ChattopadhyayQuantization Noise Cancellation Through Modelling of Non-Linearities in Sigma Delta ModulatorsAuthorsStijn RingelingMarco FattoriShagun BajoriaRobert RuttenLucien BreemsEugenio CantatorePrinciples for Enabling TEEs on Domain-Specific AcceleratorsAuthorsAritra DharSupraja SridharaShweta ShindeSrdjan CapkunRenzo AndriEnhancing Performance of Deep Neural Networks with a Reduced Retention-Time MRAM-Based Memory ArchitectureAuthorsMunhyung LeeTaehan LeeJunwon YeoHyukjun LeeTDM: Time and Distance based Metric for Quantifying Information Leakage Vulnerabilities in SoCsAuthorsAvinash AyalasomayajulaHasan Al-ShaikhHenian LiSujan SahaFarimah FarahmandiWhere and How to Charge: Effective Charging with Mobile Agent in Wireless Powered CPSAuthorsChenchen FuZining ZhouSujunjie SunWeiwei WuSong HanSI-Aware Wire Timing Prediction at Pre-Routing Stage with Multi-Corner ConsiderationAuthorsXu HeYushan WangRenjun ZhaoYao WangChang LiuYang GuoOperational Safety in Human-in-the-loop Human-in-the-plant Autonomous SystemsAuthorsAyan BanerjeeAranyak MaityImane LamraniSandeep GuptaHyft: A Reconfigurable Softmax Accelerator with Hybrid Numeric Format for both Training and InferenceAuthorsTianhua XiaSai Qian ZhangOptimizing Homomorphic Convolution for Private CNN InferenceAuthorsHyeri RohWoo-Seok ChoiBayesian learning-driven Memory Design Exploration with Automated Circuit Variant GenerationAuthorsDongho KimSeokhun KimJunseo LeeHongwon KimSangheon LeeJihwan ParkHanwool JeongA Hardware-Aware Framework for Practical Quantum Circuit KnittingAuthorsXiangyu RenMengyu ZhangShengyu ZhangYicong ZhengAntonio BarbalaceFEI: Fusion Processing of Sensing Energy and Information for Self-sustainable Infrared Smart Vision SystemAuthorsXin HongHaijin SuMaimaiti NazhamaitiCe ZhangJunkai HuangYuzhao YangLi LuoQi WeiZheyu LiuFei QiaoSFQ counter-based precomputation for large-scale cryogenic VQE machinesAuthorsYosuke UenoSatoshi ImamuraYuna TomidaTeruo TanimotoMasamitsu TanakaYutaka TabuchiKoji InoueHiroshi NakamuraVisionHD: Revisiting Hyperdimensional Computing for Improved Image ClassificationAuthorsFatemeh AsgarinejadJustin MorrisTajana RosingBaris AksanliEscaping local optima in global placementAuthorsKe XueXi LinYunqi ShiShixiong KaiSiyuan XuChao QianPIANIST: Efficient Quantum Circuit Simulation using Commercial Processing-in-Memory SystemAuthorsDongin LeeEnhyeok JangSeungwoo ChoiJunwoong AnCheolhwan KimWon Woo RoLabidus: Productive Accelerator Development via Configurable Soft ProcessorsAuthorsGongjin SunSeongyoung KangJane HeSang-Woo JunApprox-T: Design Methodology for Approximate Multiplication Units via Taylor-expansionAuthorsShang-shang YAOQing-jie LANGLi ShenKai LUNeuCore: A Novel Neuromorphic Processor Architecture with On-chip Event-driven LearningAuthorsYi Weizhijie YangXun Xiaoxiangyu Wangjunbo Tiejingyue ZhaoLei Wanghuadong Daiweixia Xuyuhua Tangzhenhua ZhuCellRejuvo: Rescuing the Aging of 3D NAND Flash Cells with Dense-Sparse Cell ReprogrammingAuthorsHan-Yu LiaoYi-Shen ChenJen-Wei HsiehYuan-Hao ChangHung-Pin ChenHardware-Accelerated Optimization of DSP-Based Equalizer in High-Speed ADC-Based ReceiversAuthorsYoona LeeHanseok KimJin-Seok HeoWoo-Seok ChoiA Parallel-trial Double-update Annealing Algorithm for Enabling Highly-effective State Transition on Annealing ProcessorsAuthorsAkira HyodoSatoru JimboDaiki OkonogiGenta InoueThiem ChuMasato MotomuraKazushi KawamuraEfficient Synaptic Delay Acceleration in Digital Event-Driven Neuromorphic ProcessorsAuthorsRoy MeijerPaul DettererAmirreza YousefzadehAlberto Patiño-SaucedoGuangzhi TangKanishkan VadivelYingfu XuMario KonijnenburgManil Dev GomonyFederico CorradiManolis SifalakisB-Ring:An Efficient Interleaved Bidirectional Ring All-reduce Algorithm for Gradient SynchronizationAuthorsRuixing ZongJiapeng ZhangGuoqing XiaoZhuo TangKenli LiA novel method to analysis the wafer defect patterns using an image matching algorithm based on deep neural networksAuthorsYoungwook KwonSumin OhHyunJin KimMulti-modal Signal applied Neuromorphic proven SNN Model for Stress DetectionAuthorsAjay BSMadhav RaoSASDynabLE: A Compact Transformer Inference Architecture with Saturation-Approximate Softmax Enabling Dynamic-Mapping Based Layer-Fusion ExecutionAuthorsLiu HeZongle HuangYujin WangShupei FanTang ChenHuazhong YangYongpan LiuHongyang JiaReset Domain Crossing Design Verification Closure using Advanced Data Analytics TechniquesAuthorsREETIKA REETIKASulabh KhareMatHE: A Near-Mat Processing In-Memory Accelerator for Fully Homomorphic EncryptionAuthorsMinxuan ZhouYujin NamPranav GangwarWeihong XuArpan DuttaChris WilkersonRosario CammarotaSaransh GuptaTajana RosingAthena: Add More Intelligence to RMT-based Network Data Plane with Low-bit QuantizationAuthorsYunkun LiaoHanyue LinJingya WuWenyan LuXiaowei LiGuihai YannvmXR: Design Space Exploration of Non-Volatile Memory Architectures for Edge-XR SystemsAuthorsZihan ZhangMarco DonatoLibra: Collaborating with Basis-Inverted Circuits to Mitigate State-Dependent Errors on NISQ ProgramsAuthorsEnhyeok JangYoungmin KimWon Woo RoLEAP: Layout aware Estimation of Analog design ParasiticsAuthorsPrasanth MangalagiriSiddhartha JoshiGNN-Opt: Enhancing Automated Circuit Design Optimization with Graph Neural NetworksAuthorsKazuya YamamotoNobukazu TakaiNavigating the Challenges of Statistical Fault Injection in SRAM-FPGAAuthorsTrishna RajkumarJohnny ObergA General Purpose IMC Architecture with ADC-Awared Neural NetworksAuthorsMin-Gwon SongShin-Uk KangMin-Seong ChooMulti-modal Signal applied Dynamic neuron based Spike processor for Stress DetectionAuthorsAjay BSPhani PavanMadhav RaoBalancing and Minimizing Energy Consumption of Federated Learning in Heterogeneous Mobile Edge IoTAuthorsZehao JuTongquan Wei